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PROTEASE VARIANTS 

FIELD OF THE INVENTION 

The present invention relates to novel trypsin-like 
protease variants with improved properties, DNA constructs 
5 coding for the expression of said variants, host cells capable 
of expressing the variants from the DNA constructs, as well as 
a method of producing the variants by cultivating said host 
cells. The variants may advantageously be used as constituents 
in detergent compositions and additives. 

10 BACKGROUND OF THE INVENTION 

Trypsin-like proteases, i.e. serine-proteases that in 
structure are similar to trypsin, have been extensively 
described in the literature (A J. Greer, "Comparative modelling 
methods - application to the family of mammalian serine 

15 proteases", Proteins, Vol. 7, p. 317-334, 1990; M. A. Phillips 
& R. J. Fletterick, "Proteases", Curr. Opin. Struct. Biol., 
Vol. 2, p. 713-720, 1992). 

The trypsin-like proteases can be divided into 
different families bases on their structure (A. Sali & T. 

20Blundell, "Definition of general topological equivalence in 
protein structures", J. Mol . Biol., 212, p. 403-428, 1990; J. 
P. Overington et al . , "Environment -specific amino acid substi- 
tution tables: Tertiary templates and prediction of protein 
folds", Protein Science, Vol. 1, p. 216-226). One such family 

25 of closely structurally related trypsin-like proteases com- 
prises eight mammalian and one bacterial trypsin-like pro- 
teases, i.e. the following nine proteases: Trypsin (Rattus 
rattus), PDB Code ltrm, S. Sprang et al . , "The three- 
dimensional structure of Asnl02 mutant of trypsin", Science, 

30Vol. 235, p. 905, 1987; Trypsin (Bos taurus) . PDB Code 2ptn, J. 
Walter et al., "On the disordered activation domain in 
trypsinogen" , Acta Cryst., Vol. 38B, p. 1462, 1982; Tonin 
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( Rat.tus rattus ) , PDB Code Iton, M. Fujinaga et al . , "Rat 
submaxillary gland serine protease, Tonin", J. Mol . Biol., Vol. 
195, p. 373, 1987; Kallikrein A ( Sus scrofa ) , PDB Code 2pka, W. 
Bode et al., "Refined 2 Angstroms X-ray crystal structure of 
Sporcine pancreatic Kallikrein A", J. Mol. Biol., Vol. 164, p. 
273, 1983; Gamma -chymo trypsin ( Bos taurus ) , PDB Code 2gch, G. 
H. Cohen et al., "Refined crystal structure of Gamma-chymo- 
trypsin at 1 . 9 Angstroms resolution", J. Mol. Biol., Vol. 148, 
p. 449, 1981; Pancreatic Elastase ( Sus scrofa ) , PDB Code 3est, 

10E. Meyer et al., PDB Code 3est, E. Meyer et al., "Structure of 
native porcine pancreatic elastase at 1.65 Angstroms resolu- 
tion'', Acta Cryst., Vol. 44B, p. 26, 1988; Neutrophil Elastase 
( Homo sapiens ) , PDB Code Ihne, M. A. Navia et al . , "Structure 
of Human Neutrophil Elastase in complex with a peptide chloro- 

15 methyl ketone inhibitor at 1.84 Angstroms resolution", Proc. 
Nat. Acad. Sci. USA, Vol. 86, p. 7, 1989; Mast Cell Proteinase 
( Rattus rattus ) , PDB Code 3rp2, S. J. Remington et al . , "The 
structure of Rat Mast Cell protease II at 1.9 Angstoms resol- 
ution", Biochemistry, Vol. 27, p. 8097, 1988. 

20 The PDB Codes refer to the structural data files 

deposited at the Brookhaven Protein Data Bank (F. C. Bernstein 
et al., "The Protein Data Bank: A computer based archival file 
for macromolecular structures", J. Mol. Biol., Vol. 112, p. 
535-542, 1977. 

25 WO 89/06270 discloses a trypsin-like protease obtained 

from a strain of the fungal species Fusarium oxvsporum as well 
as a detergent composition comprising the enzyme. No informa- 
tion as to the amino acid sequence of the enzyme or the three- 
dimensional structure thereof have been published. 

30 SUMMARY OF THE INVENTION 

The trypsin-like F. oxvsporum p rotease described in 
WO 89/06270 has now been cloned and sequenced and the three- 
dimensional structure thereof has been elucidated by X-ray 
crystallography. This trypsin- like protease has surprisingly 
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been found to belong to the above mentioned trypsin family 
comprising eight known mammalian and one bacterial trypsin-like 
protease. From a comparison with the structure of the trypsins 
belonging to this family it has surprisingly been shown that 
5 while the structures are very similar near the active and bin- 
ding sites and in the core of the protein, there are consi- 
derable differences in other regions, particularly in the loops 
on the surface of the molecule. Furthermore, there is no 
evidence of any divalent cation binding sites in the L. 
10 oxysporum trypsin-like protease. 

It is the object of the present invention to design 
novel variants of a trypsin-like Fusarium p rotease having 
improved properties as compared to those of the parent trypsin- 
like protease. 

15 Accordingly, in a first aspect the present invention 

relates to a variant of a parent trypsin-like Fusarium 
protease, which 

i) reacts with an antibody raised against or reactive 
with at least one epitope of the trypsin-like F . oxysporum 

20 protease comprising the amino acid sequence shown in the 
appended Sequence Listing ID No. 2, 

ii) is at least 60% homologous with the amino acid 
sequence of the trypsin-like F. oxysporum protease shown in SEQ 
ID No. 2, and/or 

25 iii) is encoded by a DNA sequence which hybridizes 

with an oligonucleotide probe hybridizing with a DNA sequence 
encoding the trypsin-like F . oxysporum protease having the 
amino acid sequence shown in SEQ ID No. 2. 

In the present context, the term "trypsin-like 

30 Fusarium protease" is intended to indicate a trypsin-like 
protease derived from a fungus of the genus Fusarium , and in 
particular of the species F. oxvsporum , or a functional 
analogue thereof. 

The term "functional analogue" is intended to indicate 

35 a trypsin-like protease which is immunologically cross-reactive 
with the trypsin-like F . oxvsporum protease described herein, 



WO 95/30743 



PCT/DK95/00180 



4 

comprises an amino acid sequence which is more than 60% 
homologous with that of the trypsin- like F. oxysporum protease 
shown in SEQ ID No. 2, such as more than 70%, 80% or even 90% 
with said protease, is encoded by a DNA sequence hybridizing 

5 with an oligonucleotide probe which also hybridizes with a DNA 
sequence encoding the trypsin- like F . oxvsporum protease the 
amino acid sequence of which is shown in SEQ ID No, 2, and/or 
has a three-dimensional structure having a core which is 
substantially similar to the core of the trypsin- like F\ 

10 oxvsporum protease and which preferably has one or more loop 
structures corresponding to loops II, IV, VI, IX, X and XIII of 
the trypsin-like F . oxvsporum protease described herein. The 
term "corresponding" as used about the loop structures is 
intended to indicate an identity of at least 60% such as 70%, 

15 80%, 90% or up to 100% with the corresponding F . oxvsporum 
protease loop structure (s) . The properties characterizing the 
functional analogue are intended to be understood in an 
analogous manner to properties i)-iii) listed above and further 
described below. 

20 For ease of reference, the following disclosure is 

based on the trypsin-like protease derived from the species F. 
oxvsporum (DSM 2672) , the cDNA and amino acid sequences of 
which are apparent from SEQ ID Nos. 1 and 2, respectively. It 
will be understood, however, that also functional, analogues of 

25 said protease as defined above, e.g. trypsin-like proteases 
derivable from other organisms such as microorganisms including 
bacterial and fungal strains, and in particular from other 
strains of Fusarium spp,, may be modified in a manner similar 
to that described for the trypsin-like F . oxvsporum protease 

30 described herein. Accordingly, variants of such functional 
analogous are intended to be considered to be within the scope 
of the present invention. Examples of other Fusarium strains, 
which have been found to produce trypsin-like proteases, are F. 
merismoides . F. redolens , F. sambucinum , F. solani and F\_ 

35 verticilloides . 



WO 95/30743 



PCI7DK95/00180 



5 

The term "variant" is intended to indicate a 
polypeptide which is derived from a trypsin- like Fusarium 
protease as defined above and which has one or more of the 
properties i) -iii) which will be further discussed below. Typi- 
Scally, the variant differ from the trypsin-like protease by one 
or more amino acid residues, which, for instance, may have been 
added or deleted from either or both of the N-terminal or C- 
terminal end of the protease, inserted or deleted at one or 
more sites within the amino acid sequence of the protease, or 

10 substituted with one or more amino acid residues within, or at 
either or both ends of the amino acid sequence of the protease. 

As stated above, the comparison of the three-dimen- 
sional structure of the trypsin-like Fusarium protease (shown 
in Fig. 1) with that of other known trypsins revealed a 

15 considerable difference in the surface structures of the 
proteases. It is contemplated that properties of other trypsin- 
like proteases may be improved when loop structures of the 
trypsin-like F. oxysporum p rotease disclosed herein are 
inserted in or substituted for loop structures of such trypsin- 

20 like proteases. 

Accordingly, in a further aspect the present invention 
relates to a variant of a parent trypsin-like protease compris- 
ing at least one of the loop structures of the trypsin-like 
Fusarium protease. 

25 In the present context the term "trypsin-like 

protease" is intended to indicate an enzyme having a three- 
dimensional structure similar to that of the class of trypsins 
listed in Table 2, below. It will be understood that trypsin as 
such is considered to be included within this definition. 

30 The variants of the present invention are contemplated 

to have improved substrate specificities, catalytic rate, 
stability, especially towards the action of proteolytic enzymes 
and/or detergent ingredients, thermostability, storage stabil- 
ity, improved resistance towards peroxidase /pHBS inactivation, 

35 and/or improved wash performance. 
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The present invention also relates to a DNA construct compris- 
ing a DNA sequence encoding a trypsin-like protease variant as 
indicated above, a recombinant expression vector carrying said 
DNA construct, a cell transformed with the DNA construct or the 

5 expression vector, as well as a method of producing a trypsin- 
like protease variant of the invention by culturing or growing 
said cell under conditions conducive to the production of the 
variant, after which the variant is recovered from the culture. 

The invention further relates to a enzyme granulate, 

10a liquid enzyme composition or a protected enzyme preparation 
comprising a trypsin-like protease variant of the invention and 
suitable for the preparation of e.g. a detergent composition 
comprising a trypsin-like protease variant of the invention. 

BRIEF DESCRIPTION OF DRAWINGS 

15 The present invention is further illustrated by 

reference to the accompanying drawings, in which: 

Fig. 1 shows the three-dimensional structure of the 
Fusarium trypsin-like protease; 

Fig. 2 shows the three-dimensional structure of the 
20mammalian trypsin-like proteases isolated from Trypsin (Rattus 
rattus) (ltrm) ; 

Fig. 3 shows the three-dimensional structure of 
Trypsin ( Bo taurus ) (2ptn) ; 

Fig. 4 shows the three-dimensional structure of Tonin 
25 ( Rattus rattus ) (lton) ; 

Fig. 5 shows the three-dimensional structure of 
Kallikrien A ( Sus scrofa ) (2pka) ; 

Fig. 6 shows the three-dimensional structure of y- 
chymotrypsin ( Bos taurus ) (2gch) ; 
30 Fig. 7 shows the three-dimensional structure of 

Pancreatic Elastase ( Sus scrofa ) (3est) ; 

Fig. 8 shows the three-dimensional structure of 
Neutrophil Elastase ( Homo sapiens ) (lhne) ; 
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Fig. 9 shows the three-dimensional structure of Mast 
Cell Proteinanse ( Rattus rattus ) (3rp2) ; and 

Fig. 10 shows the three-dimensional structure of the 
bacterial trypsin-like protease isolated from Trypsin 
5 ( Streptomyces griseus ) (Isgt) . 

DETAILED DESCRIPTION OF THE INVENTION 

In the present description and claims, the following 
abbreviations are used: 

Amino Acids 



10A = 


Ala 




Alanine 


V = 


Val 




Valine 


L = 


Leu 




Leucine 


I = 


He 




Isoleucine 


P = 


Pro 




Proline 


15 F = 


Phe 




Phenylalanine 


W = 


Trp 




Tryptophan 


M = 


Met 




Methionine 


G = 


Gly 




Glycine 


S = 


Ser 




Serine 


20T = 


Thr 




Threonine 


C = 


Cys 




Cysteine 


Y = 


Tyr 




Tyrosine 


N = 


Asn 




Asparagine 


Q = 


Gin 




Glut amine 


25 D = 


Asp 




Aspartic Acid 


E = 


Glu 




Glutamic Acid 


K = 


Lys 




Lysine 


R = 


Arg 




Arginine 


H = 


His 


ss 


Histidine 



30 Sequence Numbering 

In order to simplify the ensuing discussion a sequence 
numbering based on a structural alignment of the parent 
trypsin-like Fusarium protease with that of eight mammalian 
trypsins and one bacterial trypsin have been used. The rela- 

35tionship between the structural sequence numbering used and 
that of the amino acid sequence shown in SEQ ID No. 2 is 
apparent from Table 1, below. In Table 1 the residue numbering 
of y-chymotrypsin has been used as a reference. A listing of 
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the structural alignment of the ten trypsin like proteases is 
shown in Table 2, identified by PDB Codes. 



Table 1 

Reference- Amino acid of Sequential numbering 
5 numbering SEQ ID No, 2 of SEQ ID No. 2 



16 


I 


1 


17 


V 


2 


18 


G 


3 


19 


G 


4 


10 20 


T 


5 


21 


S 


6 


22 


A 


7 


23 


S 


8 


24 


A 


9 


15 25 


G 


10 


26 


D 


11 


27 


F 


12 


28 


P 


13 


29 


F 


14 


20 30 


I 


15 


31 


V 


16 


32 


S 


J. / 


33 


I 


1 Q 

lo 


34 


s 


1 Q 

iy 


25 35 


R 


20 


36 


N 


*3 1 


38 


(j 


Z Z 


"3 Q 


r* 
\j 




40 


P 


24 


30 41 


W 


25 


42 


c 


26 


43 


G 


27 


44 


G 


28 


45 


S 


29 


35 46 


L 


30 


47 


L 


31 


48 


N 


32 


49 


A 


33 


50 


N 


34 


40 51 


T 


35 


52 


V 


36 


53 


L 


37 


54 


T 


38 


55 


A 


39 


45 56 


A 


40 


57 


H 


41 


58 


C 


42 


59 


V 


43 


59a 


s 


44 
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59b 


G 


45 


59c 


Y 


46 


60 


A 


47 


61 


Q 


48 


5 62 


s 


49 


63 


G 


50 


64 


F 


51 


65 


Q 


52 


66 


I 


53 


10 67 


R 


54 


68 


A 


55 


69 


G 


56 


70 


S 


57 


71 


L 


58 


15 72 


s 


59 


73 


R 


60 


74 


T 


61 


78 


s 


62 


79 


G 


63 


20 8 0 


G 


64 


81 


I 


65 


82 


T 


66 


83 


s 


67 


84 


s 


68 


25 85 


L 


69 


86 


s 


70 


87 


s 


71 


88 


V 


72 


89 


R 


73 


30 90 


V 


74 


91 


H 


75 


92 


P 


76 


93 


s 


77 


94 


Y 


78 


35 9 5 


s 


79 


98 


G 


80 


99 


N 


81 


100 


N 


82 


101 


N 


83 


40102 


D 


84 


103 


L 


85 


104 


A 


86 


105 


I 


87 


106 


L 


88 


45 107 


K 


89 


108 


L 


90 


109 


S 


91 


110 


T 


92 


111 


S 


93 


50112 


I 


94 


113 


P 


95 


114 


S 


96 


115 


G 


97 


116 


G 


98 



r 
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117 


N 


99 


118 


I 


100 


119 


G 


101 


120 


Y 


102 


5 121 


A 


103 


122 


R 


104 


123 


L 


105 


124 


A 


106 


125 


A 


107 


10126 


S 


108 


127 


G 


109 


128 


S 


110 


129 


D 


111 


130 


P 


112 


15 131 


V 


113 


132 


A 


114 


133 


G 


115 


134 


S 


116 


135 


S 


117 


20136 


A 


118 


137 


T 


119 


138 


V 


120 


139 


A 


121 


140 


G 


122 


25141 


W 


123 


142 


G 


124 


143 


A 


125 


144 


T 


126 


145 


S 


127 


30146 


E 


128 


147 


G 


129 


148 


G 


130 


149 


S 


131 


150 


S 


132 


35151 


T 


133 


152 


P 


134 


153 


V 


135 


154 


N 


136 


155 


L 


137 


40156 


L 


138 


157 


K 


139 


158 


V 


140 


159 


T 


141 


160 


V 


142 


45161 


P 


143 


162 


I 


144 


163 


V 


145 


164 


s 


146 


165 


R 


147 


50166 


A 


148 


167 


T 


149 


168 


C 


150 


169 


R 


151 


170 


A 


152 
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171 


Q 


153 


172 


y 


154 


173 


G 


155 


174 


T 


156 


5 175 


S 


157 


176 


A 


158 


177 


I 


159 


177a 


T 


160 


178 


N 


161 


10179 


Q 


162 


180 


M 


163 


181 


F 


164 


182 


C 


165 


183 


A 


166 


15184 


G 


167 


185 


V 


168 


185b 


S 


169 


185c 


s 


170 


186 


G 


171 


20187 


G 


172 


188 


K 


173 


189 


D 


174 


190 


S 


175 


191 


C 


176 


25 192 


Q 


177 


193 


G 


178 


194 


D 


179 


195 


S 


180 


196 


G 


181 


30197 


G 


182 


198 


P 


183 


199 


I 


184 


200 


V 


185 


201 


D 


186 


35 202 


S 


187 


203 


S 


188 


207 


N 


189 


208 


T 


190 


209 


L 


191 


40 210 


I 


192 


211 


G 


193 


212 


A 


194 


213 


V 


195 


214 


S 


196 


45 2 1 5 


W 


197 


216 


G 


198 


217 


N 


199 


219 


G 


200 


220 


C 


201 


50221 


A 


202 


222 


R 


203 


223 


P 


204 


223a 


N 


205 


224 


Y 


206 
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225 


S 


207 


226 


G 


208 


227 


V 


209 


228 


Y 


210 


5229 


A 


211 


230 


S 


212 


231 


V 


213 


232 


G 


214 


233 


A 


215 


10 234 


L 


216 


235 


R 


217 


236 


S 


218 


237 


F 


219 


238 


I 


220 


15239 


D 


221 


240 


T 


222 


241 


Y 


223 


242 


A 


224 



Table 2 

20 16 20 30 37abc 40 50 59abc 

I ™ | ========= | ====== ! ===== ! ========= ! ======== | | 

lton IVGGYKCEKNSQPWQVAVIN E YLCGGVL IDPSWVI TAAHCY - - - 

2pka I IGGRECEKNSHPWQVAI YHY SSFQCGGVLWPKWVLTAAHCK- - - 

ltrm IVGGYTCQENSVPYQVSLNS GYHFCGGSLINDQWVVSAAHCY 

25 2ptn IVGGYTCGANTVPYQVSLNS GYHFCGGSLINSQWWSAAHCY- - - 

2gch IVNGEEAVPGSWPWQVSLQDKT- - - GFHFCGGSLINENWVVTAAH CG 

3est WGGTEAQRNSWPSQISLQYRSGSSWAHTCGGTLIRQNWVMTAAHCV 

lhne IVGGRRARPHAWPFMVSLQLR GGHFCGATLIAPNFVMSAAHCV- AN 

3rp2 1 1 GGVES I PHSRP YMAHLD I VTEKGLRVI CGGFL I SRQFVLTAAHCK - - - 

30 lsgt WGGTRAAQGEFPFMVRLS MGCGGALYAQDI VLTAAHCVSGS 

sp3 87 I VGGTS ASAGDFPF I VS I SRN GG PWCGGSLLNANTVLTAAHCVSGY 

16 20 30 37abc 40 50 59abc 



35 

lton 
2pka 
ltrm 
2ptn 

40 2gch 
3est 
lhne 
3rp2 
lsgt 

45 sp387 



62a 70 80 90 95 >ab ^ xy^ 100 

SN- -NYQVLLGRIWLFKDEPFAQRRLVRQSFRHPDYI PL - - IPVHDHSND 
ND- -NYEVWLGRHNLFENENTAQFFGVTADFPHPGFNLS- -ADGKDYSHD 

KS - -RIQVRLGEHNINVLEGNEQFVNAAKI IKHPNFD - -RKTLNNN 

KS- -GIQVRLGEDNINWEGNEQFISASKSIVHPSYN - -SNTLNND 

VTT-SDVWAGEFDQGSSSEKIQKLKIAKVFKNSKYN SLTINND 

DRELTFRWVGEHNLNQNNGTEQ YVGVQK I WHP YWN TDDVAAGYD 

VNVRAVRWLGAHNLSRREPTRQVFAVQRIFED- GYD PVNLLND 

GR- -EITVILGAHDVRKRESTQQKIKVEKQI IHESYN -SVPNLHD 

GNNTS ITATGGWDLQS - - GAAVKVRSTKVLQAPGYN - GTGKD 

AQS- GFQ I RAGSLSRT - - - SGGI TSSLSSVRVHPS YS 



-GNNND 



62a 



70 



80 



90 



95ab xy 100 



103 



110 



120 



130 



140 



147abl50 
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lton LMLLHLSEPADITGGVKVIDLPT- -KEPKVGSTCLASGWGSTNPS- -EMV 

2pka LMLLRLQS PAK I TDAVKVLEL P T - -QEPELGSTCEASGWGSIEPGPDDFE 

ltrm IMLIKLSSPVKLNARVATVALPS- -SCAPAGTQCLISGWGNTLSS- -GVN 

5 2ptn I ML I KLKS AAS LNSRVAS I S LPT - -SCASAGTQCLISGWGNTKSS- -GTS 

2gch ITLLKLSTAASFSQTVSAVCLPSASDDFAAGTTCVTTGWGLTRY 

3est I ALLRLAQSVTLNS YVQLGVLPRAGTI LANNSPCYI TGWGLTRTNG - - -Q 

Ihne I VI LQLNGS AT INANVQVAQLPAQGRRLGNGVQCLAMGWGLLGRNR - - -G 

3rp2 IMLLKLEKKVELTPAVNWPLPSPSDFIHPGAMCWAAGWGKTGVR DP 

10 lsgt WALIKLAQPIN QPTLKIAT- -TTAYNQGTFTVAGWGANREG GS 

sp3 8 7 LAILKLSTSI PSGGNIGYARLAASGSDPVAGSSATVAGWGATSEG- -GSS 



i 

i 

103 



•I — 
110 



120 



130 



_ i 

- 1 — - 

140 



i i i i 
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lton 
2pka 
ltrm 
2ptn 

20 2gch 
3est 
lhne 
3rp2 
lsgt 

25 sp387 



150 0160 169ab 177al80 185abc 190 194 

| j „_ j J j ======= | | | . | ==== ill! i -« i 

VSHDLQCVNI HLLSNEKC I - - ETYKDNVT - DVMLCAGEM - EGGKDTCAGD 
F PDE I QCVQLTLLQNTFCA - - DAHPDKVT - ESMLCAGYL - PGGKDTCMGD 
EPDLLQCLDAPLLPQADCE - - AS YPGKI T - DNMVCVGFL - EGGKDSCQGD 
YPD VLKCLKAP I LSDSS CK - - SAYPGQIT- SNMFCAGYL- EGGKDSCQGD 
TPDRLQQASLPLLSNTNCK- -KYWGTKIK-DAMICAGA- - -SGVSSCMGD 
LAQTLQQAYLPTVDYAICSSSSYWGSTVK-NSMVCAGG--DGVRSGCQGD 

I AS VLQELNVTW - TSLCR RSNVCTLVR - GRQAGVCFGD 

TSYTLREVELRIMDEKACV- -DYR- - YYEYKFQVCVGSP - TTLRAAFMGD 
QQRYLLKANVPFVSDAACR- - SAYGNT VANE E I CAGYPDTGGVDTCQGD 
TPVNLLKVTVP I VSRATCR- -AQYGTi rilTNQMFCAGV- SSGGKDSCQGD 



150 



======== | j | ======= 1 1 1 = | ===== j j j | ===== j === 



0160 



169ab 



177al80 185abc 190 194 



195 200 204a 210 218a 223a 230 240 

30 lton SGGPLICD- - - - - GVLQG I TSGGAT - PCAKPKTPAI YAKL I KFTS W I KKV 

2pka SGGPLICN GMWQGITSWGHT-PCGSANKPSIYTKLIFYLDWIDDT 

ltrm SGGPWCN GELQG I VSWGY - -GCALPDNPGVYTKVCNYVDWIQDT 

2ptn SGGPWCS GKLQGIVSWGS- -GCAQKNKPGVYTKVCNYVSWIKQT 

2gch SGGPLVCKKN- GAWTLVGI VSWGSS - TCSTS - TPGVYARVTALVNWVQQT 

35 3est SGGPLHCLVN-GQYAVHGWSFVSRLGCNVTRKPTVFTRVSAYISWINNV 

lhne SGSPLVCN GL I HG I AS FVRG - GCASGLYPDAFAP VAQFVNWI DS I 

3rp2 SGGPLLCA GVAHG I VS YGHP - DAK - - -PPAIFTRVSTYVPWINAV 

lsgt SGGPMFRKDNADEWIQVGIVSWGY- - GCARPGYPGVYTEVSTFASAI ASA 

sp387 SGGPIVDSS - - - -NTLIGAVSWGN- -GCARPNYSGVYA3VGALRSFIDTY 

195 200 204a 210 218a 223a 230 240 
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242 


Iton 


MKENP 


2pka 


ITENP 


5 ltrm 


IAAN- 


2ptn 


IASN- 


2gch 


LAAN- 


3est 


IASN- 


Ihne 


IQ- — 


10 3rp2 


IN 


lsgt 


ARTL- 


sp387 


A 




242 



15 In describing trypsin- like protease variants according 

to the invention, the following nomenclature is used for ease 
of reference: 

Original amino acid(s) : posit ion (s) : substituted amino acid(s) ; 
the position being indicated in accordance with the structural 
20 amino acid numbering apparent from Table 1. 

According to this nomenclature, for instance the 
substitution of glutamic acid for glycine in position 185b is 
shown as : 

Gly 185b Glu or G185bE 

25a deletion of glycine in the same position is shown as: 

Gly 185b * or G185b* 

and insertion of an additional amino acid residue such as 
lysine is shown as : 

Gly 185b GlyLys or G185bGK 
30 Where a specific trypsin- like protease contains a 

"deletion" in comparison with other trypsin- like proteases and 
an insertion is made in such a position this is indicated as: 

* 36 Asp or *36D 

for insertion of an aspartic acid in position 36 



35 Multiple mutations are separated by pluses, i.e.: 

Arg 170 Tyr + Gly 195 Glu or R170Y+G195E 
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representing mutations in positions 170 and 195 substituting 
tyrosine and glutamic acid for arginine and glycine, respec- 
tively. 

When one or more alternative amino acid residues may 
5 be inserted in a given position it is indicated as 
G185bE,D,S or 
G185bE or G185bD or G185bS 



Description of ths trypsin - like F. oxvsporum protease 

The following description of variants of the trypsin- 

10 like Fusarium protease of the invention is based on the parent 
trypsin- like protease derived from the strain of F. oxvsporum 
deposited with the Deutsche Sammlung von Mikroorganismen with 
the deposit number DSM 2672. It will be understood that 
functional analogues of said trypsin- like protease as defined 

15 above, e.g. other parent trypsin-like Fusarium proteases, may 
be modified in a similar manner to that described for the 
trypsin-like F. oxvsporum protease, e.g. by modifying similar 
positions (according to a structural alignment) . 

The parent trypsin-like p oxvsporum protease is 

20 encoded by the DNA sequence shown in the appended Sequence 
Listing ID No. 1, and the corresponding cDNA sequence an? amino 
acid sequence are shown in SEQ ID Nos. 2 and 3, respec -vely . 
The trypsin-like F - oxvsporum protease is expressed in the form 
of an inactive proenzyme also comprising a signal peptide. The 

25 mature, active enzyme consists of 224 amino acid residues and 
has a molecular weight of 22190. The signal peptide is contem- 
plated to be amino acid residues -24 to -8 (according to the 
rules of von Heijne (1986) and the pro-peptide amino acid 

residues -7 to -1. 
30 The trypsin-like f. oxvsporum protease shows a 

reversed Arg/Lys specificity compared to that of bovine 

trypsin, which means that the trypsin-like F\ — oxysporum 

protease is more Arg-active than Lys-active. 

The three-dimensional structure of the Fusarium 
35 trypsin-like protease is shown in Fig. 1 and that of the 
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mammalian trypsin-like proteases isolated from Trypsin (Rattus 
rattus ) (ltrm), Trypsin ( Bo taurus ) (2ptn) , Tonin ( Rattus 
rattus ) (lton) , Kallikrien A (Sus_scrofa) (2pka) , y- 
chymotrypsin ( Bos taurus ) (2gch) , Pancreatic Elastase (Sus 
Sscrofa) (3est) , Neutrophil Elastase (Homo sapiens) (lhne) and 
Mast Cell Proteinanse ( Rattus rattus ) (3rp2) in Figs. 2-9, 
respectively and that of the bacterial trypsin-like protease 
isolated from Trypsin ( Streptomyces ariseus ) (lsgt) in Fig. 10. 

As mentioned above considerable differences in the 
10 loop structures of these trypsin-like proteases have been 
observed. The loop structures of the F - oxvsporum trypsin-like 
protease have been identified and are listed in Table 3. 
Corresponding loop structures may be identified in other 
trypsin-like proteases, including the ones illustrated in Figs. 
15 2-10. The amino acid segments listed in this table refer to the 
positions in the amino acid sequence of the trypsin-like 
Fusarium protease listed in SEQ ID No. 2. The loops identified 
in the table may have different backbone conformations. 
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Table 3 





Loop 


Segments 


Amino acid seq. 




I 


23-27 


SAGDF 




II 


34-40 


RNGG 


5 


III 


47-50 


LNAN 




IV 


59-64 


VSGYAQSGF 




V 


70-80 


SLSRTSGG 




VI 


91-99 


HPSYSGN 




VII 


110-118 


TSIPSGGNI 


10 


VIII 


125-133 


ASGSDPVA 




IX 


145-151 


SEGGSST 




X 


163-179 


SRATCRAQ YGTS AI TNQ 




XI 


185-187 


GVSSGG 




XII 


201-208 


DSSNT 


15 


XIII 


215-225 


WGNGCARPNYS 



The intervening segments which are not mentioned in 

the above listing form the core of the proteins, and each of 

these segments have similar backbone conformations for all of 
the trypsin-like proteases listed in Table 2. 

20 Description of variants of the invention 

As explained above the variant of the trypsin-like 
protease of the invention has one or more characteristic 
properties, some of which will be explained in detail in the 
following. 

25 Property i) of the variant of the trypsin-like 

protease, i.e. the immunological cross reactivity, may be 
assayed using an antibody raised against or reactive with at 
least one epitope of the trypsin-like protease comprising the 
amino acid sequence shown in SEQ ID No. 2. The antibody, which 

30 may either be monoclonal or polyclonal, may be produced by 
methods known in the art, e.g. as described by Hudson et al., 
1989. The immunological cross -reactivity may be determined 
using assays known in the art, examples of which are Western 
Blotting or radial immunodiffusion assay, e.g. as described by 

35 Hudson et al . , 1989. 

Property ii) of the variant of the trypsin-like 
protease of the invention, i.e. the homology between the amino 
acid sequence of the variant and the amino acid sequence shown 
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in SEQ ID No. 2 is intended to indicate the degree of identity 
between the two sequences indicating a derivation of the first 
sequence from the second. In particular, a polypeptide is 
considered to be homologous to the trypsin- like protease if a 

5 comparison of the respective amino acid sequences reveals an 
identity of greater than about 60%, such as above 70%, 80% or 
85%. It will be understood that variants in which a high number 
of amino acid residues have been modified have a lower degree 
of identity, whereas variants comprising only few amino acid 

lOmodifications will show identities of more than 90%, such as 
more than 95% or even 99% identity. Sequence comparisons can be 
performed via known algorithms, such as the one described by 
Lipman and Pearson (1985) . 

The oligonucleotide probe used in the characterization 

15 of the variants of the invention in accordance with property 
iii) defined above, may suitably be prepared on the basis of 
the full or partial nucleotide or amino acid sequence shown in 
SEQ ID No. 1 and 2, encoding or constituting, respectively, the 
trypsin- like F - oxvsporum protease described herein. The 

20 hybridization may be carried out under any suitable conditions 
allowing the DNA sequences to hybridize. For instance, such 
conditions are hybridization under specified conditions, e.g. 
involving presoaking in 5xSSC and prehybridizing for lh at 
-40°C in a solution of 20% formamide, 5xDenhardt's solution, 

25 50mM sodium phosphate, pH 6.8, and 50/xg of denatured sonicated 
calf thymus DNA, followed by hybridization in the same solution 
supplemented with 100/xM ATP for I8h at ~40°C, or other methods 
described by e.g. Sambrook et al., 1989. 

In the following specific classes of trypsin- like 

30 Pusarium protease variants of the invention having improved 
properties are described as well as the concepts used for the 
design of such variants. 



Proline stabilization 

In order to improve the stability of a trypsin- like 
35 Fusarium protease, such as the thermal, storage and/or protease 
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stability thereof, it may be advantageous to introduce one or 
more proline residues therein. The improvement obtained by such 
modification may be explained on the basis of the structure of 
proteins. The modification of subtilisins by introduction of 

5 proline residues therein are described in WO 92/19729, incor- 
porated by reference herein. 

Proteases are globular proteins and quite compact due 
to the considerable amount of folding of the long polypeptide 
chain. The polypeptide chain essentially consists of the "bac- 

lOkbone" and its "side -groups" . As the peptide bond is planar, 
only rotations around the C a -N axis and the C ft -C axis are per- 
mitted. Rotation around the C ft -N bond of the peptide backbone 
is denoted by the torsion angle <t> (phi), rotation around the 
C a -C bond by i|r (psi) [vide e.g. Creiahton, TVE^ (1984); 

15 Proteins; W.H. Freeman and Company, New York] . The choice of 
the values of these angles of rotation is made by assigning the 
maximum value of +180° (which is identical to -180°) to the 
maximally extended chain. In the fully extended polypeptide 
chain, the N, C tt and C atoms are all "trans" to each other. In 

20 the "cis" configuration, the angles tf> and $ are assigned the 
value of 0°. Rotation from this position around the bonds so 
that the atoms viewed behind the rotated bond move "counter- 
clockwise" is assigned negative values by definition, those 
"clockwise" are assigned positive values. Thus, the values of 

25 the torsion angles lie within the range -180° to +180°. 

Since the C - atoms are the swivel point for the chain, 
the side-groups (R-groups) associated with the C Q -atoms become 
extremely important with respect to the conformation of the 
molecule. The conformation, which defines the participation of 

30 the secondary and tertiary structures of the polypeptide chains 
in moulding the overall structure of the protein, is of prime 
importance to the specific structure of the protein and 
contributes greatly to the unique catalytic properties (i.e. 
activity and specificity) of the enzyme and its stability. 

35 Proline residues have a reduced degree of rotational 

freedom around the N-C a bond compared to other types of amino 
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acids, because the proline sidechain connects back to the amide 
nitrogen. This connectivity usually restricts the 0-angles of 
proline residues to a narrow interval around -60°. 

The equilibrium between the unfolded and folded state 

5 of a protein is to a large extent governed by the entropy 
difference between the two states, and consequently it is 
envisaged that the trypsin-like Fusarium protease can be 
stabilized by reducing the number of different conformations 
that are accessible in the unfolded state. Introduction of 

10 proline residues for other residues in the protein sequence 
generally reduces the entropy of the unfolded state, due to the 
restricted rotational freedom for proline residues. For such 
substitutions to have effect on the stability of the protein, 
they must be compatible with the structure of the protein in 

15 the folded state, that is, the substituted residues must have 
<£-angles in the folded state that are in the allowed interval 
for prolines, and the introduced prolines must not cause an 
energetically unfavourable packing of the protein atoms. 

In accordance with the above theoretical explanation 

20 trypsin- like Fusarium protease variants are contemplated, in 
which a naturally occurring amino acid residue (other than 
proline) of the amino acid sequence of the parent trypsin-like 
protease has been substituted with a proline residue at one or 
more positions, at which positions (s) the dihedral angles <f> 

25 (phi) and i|r (psi) constitute values within the intervals [- 
90°<tf><-40° and -180°<t|j<180 o ] , preferably within the intervals 
[-90°«£<-40° and 120°<i|r<180 o ] or [-90°«£<-40° and -50 0 <t|i<10 0 ] , 
and which position (s) is/are not located in regions in which 
the protease is characterized by possessing or-helical or S- 

30 sheet structure. 

The stabilized trypsin-like Fusarium p rotease variants 
according to this embodiment of the invention may be prepared 
by subjecting the trypsin-like Fusarium p rotease to analysis 
for secondary structure, identifying residues in the protease 

35 having dihedral angles <f> (phi) and $ (psi) confined to the 
intervals [-90°«/><-40° and -180°<^<180°] , preferably the inter- 
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vals [-90 o «£<-40° and 120°<ilr<180°] or [-90 o «£<-40° and - 
50°<i|k10°] , excluding residues located in regions in which the 
trypsin- like Fusarium p rotease is characterized by possessing 
a-helical or fc-sheet structure, if a proline residue is not 
5 already at the identified position (s) , substitution of the 
naturally occurring amino acid residue with a proline residue 
at the identified position (s) , preferably by site directed 
mutagenesis of a gene encoding the trypsin- like Fusarium p ro- 
tease, and subsequent expression and recovery of the resulting 
10 trypsin- like Fusarium protease variant as described in detail 
below. 

The analysis of the parent trypsin- like Fusarium 
protease for secondary structure may be performed on the basis 
of the atomic structure of the protease, which, e.g. may be 
15 determined by X-ray diffraction techniques. X-ray diffraction 
techniques are described by e.g. Hendrickson, W.A [X-ray 
diffraction; in Protein Engineering (Ed: Oxender, D.L and Fox, 
C.F.), ch. 1; Alan R. Liss, Inc. (1987)] and Creighton, T.E. , 
supra , ch. 6. 

20 When the atomic structure has been determined, it is 

possible to compute dihedral angles from the atomic coordi- 
nates. Moreover, it is possible to assign secondary structure 
elements. The secondary structure elements are defined on the 
basis of hydrogen bindings. Cooperative secondary structure is 

25 recognized as repeats of the elementary hydrogen-bonding pat- 
terns "turn" and "bridge". Repeating turns are "helices", 
repeating bridges are "ladders", connected ladders are 
"sheets" . 

A/alysis for secondary structure elements requires a 
30 computerized compilation of structure assignments and geometri- 
cal features extracted from atomic coordinates . The conven- 
tional method to elucidate the secondary structure of a pro- 
tein, based on its atomic coordinates, is described by Kabsch, 
W. and Sander, C. [Biopolymers (1983) 22 2577-2637] . In this 
35 article an algorithm for extracting structural features from 
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the atomic coordinates by a pattern-recognition process is 
provided. First, H-bonds are identified based on electrostatic 
interactions between pairs of H-bonding groups. Next, the 
patterns of H-bonding are used to define secondary structure 
5 elements such as turns (T) , bends (S) , bridges (B) , helices 
(G,H, I), fc-ladders (E) and £- sheets (E) . 

A computer program DSSP (Define Secondary Structure 
of Proteins) , enabling the computation of Kabsch & Sander files 
and written in standard PASCAL, is available from the Protein 

lOData Bank, Chemistry Dept., Brookhaven National Laboratory, 
Upton, N.Y. 11973. 

After the dihedral angles 4> (phi) and i|> (psi) for the 
amino acids have been calculated, based on the atomic structure 
in the crystalline proteases, it is possible to select posi- 

15tion(s) which has/have dihedral phi and psi angles favourable 
for substitution with a proline residue. The aliphatic side 
chain of proline residues is bonded covalently to the nitrogen 
atom of the peptide group. The resulting cyclic f ive-membered 
ring consequently imposes a rigid constraint on the rotation 

20 about the N-C a bond of the peptide backbone and simultaneously 
prevents the formation of hydrogen bonding to the backbone N- 
atom. For these structural reasons, prolines are generally not 
compatible with a-helical and £-sheet secondary conformations. 
Due to the same rotational constraint about the C a -N bond, and 

25 due to the requirement that neighbouring amino acids in the 
chain are not perturbed, the magnitudes of the dihedral angles 
phi and psi (and in particular phi) are confined to limited 
intervals for proline residues in polypeptides. The dihedral 
angles for proline residues in polypeptides are almost exclu- 

30sively within the intervals [-9O°<0<-4O° and -180°<t|r<180 o ] , 
preferably the intervals [-90°«£<-40° and 120°<i|i<180 o ] or [- 
9O°<0<-4O° and -50°<iJr<10 o ] . In this context, both cis- and 
trans -proline residues are considered. 

For the reasons stated above it is preferred that the 

35 amino acid residue (s) to be substituted with proline is a 
hydrophilic or a small hydrophobic amino acid residue, in 
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particular one selected from the amino acid residues A, D, E, 

K, G, Q,. R, S, T # N and V. 

Studies of the three-dimensional structure of the 

trypsin- like p. oxvsporutn p rotease have revealed the following 
5 positions occupied by amino acid residues, of which one or more 

may advantagously be substituted with a proline residue: 21, 

24, 30, 46, 49, 71, 90, 111, 114, 121, 124, 125, 126, 132, 135, 

146, 148, 174, 175, 176, 178, 185b, 185c, 202, 208, and 209 

(using the amino acid numbering defined in Table 1, above) . It 
10 is believed that substitution for proline in one or more of 

these positions may result in trypsin-like Fusarium protease 

variants with an improved stability. 

Specific variants of the trypsin-like F . oxysporum 

protease comprise one or more of the following substitutions: 
15A24P, A49P, V90P, SHIP, A124P, A125P, S126P, A132P, S135P, 

T174P, S175P, S185bP, S185cP, S202P. 

Stabilization bv introduction of a disulphide -bridge 

Stabilization of a given protein, especially as 
concerns thermostabilization, may be achieved by covalently 

20 binding two regions in the protein that are far apart in 
sequence but close in space. Such binding may be performed by 
the introduction of a disulphide-bridge in the protein, i.e. by 
introducing one or more cysteine residues capable of binding to 
each other or to other cysteine residues present in the 

25 protein . 

In accordance with this embodiment, the invention 
relates to a variant of a trypsin-like Fusarium protease, in 
which an amino acid residue different from cysteine of the 
amino acid sequence of the parent trypsin-like protease has 

30 been substituted with a cysteine residue in such a manner that 
the introduced cysteine residue together with another cysteine 
residue present in the parent protease or introduced therein 
form a disulphide bridge. 

Positions in which SS -bridges may be introduced may 

35 be identified by comparing the structure of the trypsin-like 
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Fusarium protease with the structures of the homologous 
trypsins listed in Table 2. By such comparison the following 
residue pair positions have been identified, between which a 
disulphide bridge can be introduced: 
522+157, 129+232, 136+201. 

Specific variants of the trypsin- like F . oxvsporum 
protease include: 

(i) A22C+D26S+K157C, 

(ii) A136C+D201C, and 

10 (iii) A124P+A125T+S126*+G127*+D129C+P13 0A+G232C 

In (i) the substitution D26S is performed to allow for 
the introduction of Cys in position 157 due to a salt bridge 
between D26 and K157. 

In (iii) due account has been taken of positions 124 
15 to 128 by shortening loop no. VII. 

Stabilization bv modification of Asn-Glv Pairs 

It is known that at alkaline pH, the sidechain of Asn 
may interact with the NH group of a sequential neighbouring 
amino acid to form an isoAsp residue where the backbone goes 

20 through the Asp sidechain. This will leave the backbone more 
vulnerable to proteolysis. The deamidation is much more likely 
to occur if the residue that follows is a Gly. Changing the Asn 
in front of the Gly or the Gly will prevent this from happening 
and thus improve the stability, especially as concerns thermo- 

25 and storage stability. 

The invention consequently further relates to a 
trypsin- like Fusarium protease variant, in which either or both 
residues of any of the Asn-Gly sequence appearing in the amino 
acid sequence of the parent trypsin- like protease is/are 
30 deleted or substituted with a residue of a different amino 
acid. 
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The Asn and/or Gly residue may, for instance, be 
substituted with a residue of an amino acid selected from the 
group consisting of A, Q, S, P, T and Y. 

More specifically, any of the Asn or Gly residues of 
5 the Asn-Gly occupying positions 36+38 and/or 217+219 of the 
parent trypsin-like protease (using the amino acid numbering 
defined in Table 1, above) may be deleted or substituted with 
a residue of an amino acid selected from the group consisting 
of A, Q, S, P, T and Y. In this connection it should be noted 
10 that there are jumps in the sequence numbering of trypsin-like 
Fusarium protease, and the two positions mentioned there are 
therefore sequential . 

Specific variants of SP387 are: 
N36S, G38S, N217S,Y, G219S 

15 Introduction of protease resistant loops 

In a further embodiment the present invention relates 
to a trypsin-like Fusarium protease variant, in which one or 
more amino acid residues present in or constituting a loop 
structure of the parent trypsin-like protease susceptible to 

20 cleavage by a proteolytic enzyme is/are deleted or replaced 
with one or more amino acid residues so as to obtain a modified 
loop structure having an improved proteolytic stability. 

It is of particular interest to modify loop structures of a 
trypsin-like Fusarium protease which are susceptible to 

25 cleavage by a subtilisin or trypsin. 

Thus, subtilisin and/or trypsin-resistant loops may 
be identified in other proteins and substituted for loop(s) of 
a parent trypsin-like Fusarium protease having a lower 
proteolytic stability. The substitution may be performed either 

30 by substituting the entire loop(s) or by substitution one or 
more amino acid residues of a loop of the parent trypsin-like 
Fusarium protease so as to obtain a modified loop having a 
higher protease stability than the original loop. The amino 
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acid residues to be substituted may be identified by comparing 
the amino acid sequence of the loop of the parent trypsin-like 
Fusarium protease with that of the "foreign" more protease - 
resistant loop. 

5 More specifically, trypsin-like Fusarium protease variants 
according to this embodiment comprises variants in which one or 
more amino acid substitutions is/are made so that the modified 
loop structure comprises at least one of the amino acid 
segments GAAG, GARG, YPGS, YPRS, HNRG, YTGN, ISSE, NNAG, SFIN, 

10 DQNG, ASFS, SRGV, LDTG, YYAA, INDI , WYFG, SIEN, GST Y , DSTN, 
PDLR, LDTG, GNRY, SGVM, RYPS, NGLV, SFSI, LGSP, RASF, VPWG, 
PDLN, SFVP, PDYR, PRLP, TVLP, IGTC, TGGT, TNKL, VGDV, IGVL, 
GSTY, RYAN, PNIP, or TLVP are contemplated to have an improved 
proteolytic stability. 

15 Other specific variants according to this embodiment 

of the invention include a variant, in which loop II of the 
trypsin-like protease comprising the peptide sequence SRNGGP is 
substituted with loop II of the trypsin 2ptn isolated from Bos 
taurus comprising the peptide sequence NSGYH, as follows: 

20 S3 4N+R3 5S+N3 6 * +G3 9Y+P4 OH , and/or 

loop IV of the trypsin-like protease comprising the 
peptide sequence VSGYAQSGF is substituted with loop IV of the 
trypsin 2ptn isolated from Bos taurus comprising the peptide 
sequence YKSGI , as follows: 

25V59Y+S59a*+G59b*+Y59c*+A60K+Q61S+S62*+F64I, and/or 

loop IV of the trypsin-like protease comprising the 
peptide sequence VSGYAQSGF is substituted with loop IV of the 
trypsin ltrm isolated from Rattus rattus comprising the peptide 
sequence YKSRI , as follows: 



30V59Y+S59a*+G59b*+Y59c*+A60K+Q61S+S62*+G63R+F64I, and/or 
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loop VI of the trypsin- like protease comprising the 
peptide sequence HPSYSGN is substituted with loop VI of the 
trypsin ltrm isolated from Rattus rattus comprising the peptide 
sequence HPNFDRKTL, as follows: 

5S93N+F94Y+S95D+*96R+*97K+G98T+N99L. 

Removal of autoproteolysis sites 

According to a further aspect of the invention 
autoproteolysis sites may be removed by changing the amino 
acids at the autoproteolysis site. Since the trypsin-like F\. 

10 oxysporum p rotease cleaves at Lys and Arg residues it is 
preferred to modify such residues of a parent trypsin-like 
Fusarium protease having the same or a similar specificity, 
preferably by substituting with a non or less positively 
charged amino acid residue. The non or less positively charged 

15 amino acid residue may be selected from the group consisting of 
K, S, V, P, E, D, N, Q, A and G; the amino acid residues K, S, 
V, or P being particularly preferred. 

Since the parent trypsin-like F. oxvsporum protease 
is specific mostly towards Arg and to a minor extend towards 

20 Lys residues, the modification of this parent trypsin-like 
protease may preferably be made by changing Arg to another 
amino acid residue (including Lys) or by changing Arg or Lys to 
a non or less positively charged amino acid as defined above. 

The following residue positions of the trypsin-like 

25 F . oxysporum protease have been found to contain Lys or Arg 
residues: 

35, 67, 73, 89, 107, 122, 157, 165, 169, 188, 222, 235, using 
the structural sequence numbering defined in Table 1. 

Of these potential autoproteolysis sites, residues 35, 
30 1 22 and 169 have been found to be primary autoproteolysis 
sites, and positions 73 and 89 have been found to be secondary 
autoproteolysis sites. Examples of specific modifications are: 
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R35K,S 
R122K,V,P 
R169K, V 

R122K,V,P+A124P 
5R122V,P+A124P+T208V. 

Alternatively autoproteolysis can be prevented by 
changing the amino acid residue occupying the position follow- 
ing the Arg or Lys residue in question to Pro. For instance, 
this may be done in the positions 90 and/or 123 (according to 
lOthe structural amino acid numbering defined in Table 1), as 
follows : 

V90P, L123P. 

Removal of critical oxidation sites 

In order to increase the stability of the trypsin-like 

15 Fusarium p rotease it may be advantageous to substitute critical 
oxidation sites, such as methionines, with other amino acid 
residues which are not subject to oxidation. 

Accordingly, in a further embodiment the present 
invention relates to a trypsin-like Fusarium protease variant, 

20 in which one or more amino acid residues susceptible to 
oxidation is/are replaced with another amino acid residue less 
susceptible to oxidation. The amino acid residue less suscep- 
tible to oxidation may for instance be selected from the group 
consisting of A, E, N, Q, I, L, S and K. 

25 Specific variants of the trypsin-like F_. oxysporum 

protease comprises one of the following substitutions: 

M180N,Q,E,K, using the sequence numbering of Table 1. 

Modification of tryptophan residues 

In order to stabilize the protein it may be advantage- 
30ous to replace or delete tryptophan residues at the surface of 
the protein, e.g. as described in US 5,118,623. The tryptophan 
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residues may advantageously be substituted for F, T, Q or G. 
Thus, in a further embodiment the invention relates to a 
trypsin- like F, oxvsnorum v ariant comprising one or more of the 
following substitutions: 
5W41F,T,Q,G 
W215F,T,Q,G 

Introduction of Glvcosvlation Sites 

The concept is to introduce N-glycosylation sites in 
loops subject to proteolysis in general and autoproteolysis in 
10 particular. 

More specifically, in accordance with this embodiment 
the invention relates to a trypsin-like Fusarium protease 
variant, in which a N-glycosylation site has been introduced at 
an amino acid residue X located in a loop structure on the 

15 surface of the protein subject to proteolysis by changing the 
sequence segment X-Y-Z to Asn-Yl-Ser or Asn-Yl-Thr, provided 
that Yl is different from Gly, so as to confer to the variant 
an improved proteolysis resistance. 

The introduction of a N-glycosylation site at residue 

20 X will make the segment target for proteins that N-glycosylate 
the amide nitrogen of the Asn residue introduced. The residues 
X, Y, Yl, Z can be any residue, except Yl should not be Gly, 
because this would create an Asn-Gly sequence the introduction 
of which may result in a less stable variant. For Asn in the 

25 changed sequence segment to become glycosylated, the changed 
sequence segment must lie on the protein surface such that it 
can be recognized by glycosylaticn proteins. 

In the parent trypsin-like F . oxvsporum protease, a 
N-glycosylation site can be introduced at the following 

30 posit ions: 

17, 18, 20, 21, 23, 24, 25, 34, 35, 36, 38, 39, 48, 49, 50, 
59a, 59b, 59c, 60, 61, 62, 63, 69, 70, 71, 72, 73, 74, 78, 79, 
80, 81, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 98, 99, 109, 
110, 111, 113, 114, 115, 116, 117, 118, 121, 123, 124, 125, 
35126, 127, 128, 129, 130, 131, 132, 133, 134, 143, 144, 145, 
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146, 147, 148, 149, 150, 151, 152, 153, 154, 163, 164, 165, 
173, 174, 175, 176, 177a, 178, 184, 185, 185b, 187, 188, 201, 
202, 203, 207, 215, 216, 217, 221, 222, 223, 224, 225, 233, 
234, 235 and/or 236. The numbers are given in accordance with 
5 the amino acid numbering defined in Table 1 and refer to the 
starting position of the target segment. Thus, for instance, 
position "17" indicates the segment 17-18-19, "18" the segment 
18-19-20 etc. 

The technique may specifically be applied to primary 

lOautoproteolysis sites containing Lys or Arg residues, that is 
cases where Y = Lys or Arg. In cases where the Lys or Arg is 
considered important for the functioning of the protein, for 
instance where a reduction in pi is undesirable, the auto-- 
proteolysis sites cannot be removed by substituting Lys and Arg 

15 to other residues where the parent trypsin-like Fusarium 
protease has low specificity. By introducing N-glycosylation 
sites on residues that are sequential neighbours to exposed Lys 
and Arg residues one may reduce the binding affinity of parent 
trypsin-like Fusarium protease for the auto-proteolysis site, 

20 and thereby prevent auto-proteolysis, while maintaining the Lys 
or Arg residues. That is Y=Yl=Lys or Arg. 

In this connection, introduction of a N-glycosylation 
site by substitution of an amino acid residue of the parent 
trypsin-like F . oxvsporum protease occupying any of the 

25positions 34, 36, 72, 74, 88, 90, 121, 123, 164, 170, 187 
(using the sequence numbering defined in Table 1) may be advan- 
tageous . 

Examples of trypsin-like F . oxvsporum variants 

comprises one or more of the following substitutions: 
30 S34N+R35R, K+N36S, T 

N36N+G38S+G39S,T 

S72N+R73R, K+T74T, S 

V8 8N+R8 9R , K+V9 0 S , T 

A121N+R122R,K+L123S,T 
35 LI 2 3N+ A2 2 4 A+ A2 2 5 S , T 

S164N+R165R,K+A166S,T 
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Variants with improved wash performance 

The ability of an enzyme to catalyse the degradation 
of various naturally occurring substrates present on the ob- 
jects to be cleaned during e.g. wash is often referred to as 
5 its washing ability, washability, detergency, or wash perform- 
ance. The present invention devices trypsin-like Fusarium 
protease variants having an improved wash " performance as 
compared to that of the parent trypsin-like protease. 

Variants with improved stability against inactivation bv 

10 peroxidase or pHBS 

It is possible that many, if not all tyrosines are 
chemically modifed by peroxidases and/or pHBS (p-hydroxy 
benzene sulfonate) . This kind of modification may cause a 
dramatic decrease in the isoelectric point of trypsin-like 

15 Fusarium proteases. To prevent such modification Tyr residues 
may be modified to other amino acids, preferably a hydrophobic 
amino acid when the Tyr is burried in the interior of the 
protein or a hydrophilic amino acid when the Tyr is exposed on 
the protein surface. 

20 In the parent trypsin-like F . oxysporum protease, the 

following tyrosine residues may be modified: 
59c, 94, 120, 172, 224, 228, 241. 

Examples of specific trypsin-like F . oxysporum 
variants comprises on or more of the following substitutions: 
25 Y59cN,Q,S,A,F 
Y94F,W 

Y120T,F,S,A,V 
Y172F,W,H 
Y224T,S,K,N,D,F 
30Y228F 

Y241T,S,V,F. 
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Variants with raised/lowered pi 

The concept is to alter the pi for the protein such 
that it approaches the pH of the detergent formulation. The pi 
can be raised by changing negatively charged or neutral amino 
5 acids to positively charged amino acids or by changing posi- 
tively charged residues to more positively charged residues. 
The pi can be lowered by changing positively charged or neural 
amino acids to negatively charged amino acids or by changing 
negatively charged amino acids to more negatively charged amino 
10 acids . 

Accordingly, in accordance with this embodiment the invention 
relates to a trypsin-like Fusarium protease variant, in which 
the net electrostatic charge of the parent trypsin-like protea- 
se has been changed by deleting or substituting one or more ne- 

lSgatively charged amino acid residues by neutral or positively 
charged amino acid residue (s), and/or by substituting one or 
more neutral amino acid residues by positively or negatively 
charged amino acid residue (s) , and/or by deleting or substi- 
tuting one or more positively charged amino acid residue (s) by 

20neutral or negatively charged amino acid residue (s), thereby 
obtaining variant which either has a lower or higher pi as 
compared to the pi of its parent protease. 

In order to have any effect on the pi, the positions 
suited for substitution should be located on the protein 

25 surface. It is preferred that the amino acid substitutions 
result in a variant protease having a pi just below the pH of 
the detergent. 

In particular, an amino acid residue located in one 
or more positions of the parent trypsin-like Fusarium protease 

30 and exposed at the surface of the molecule may be substituted: 
17, 18, 20, 21, 23, 24, 25, 26, 27, 34, 35, 36, 38, 39, 40, 41, 
48, 49, 50, 59a, 59b, 59c, 60, 61, 62, 63, 65, 67, 69, 70, 71, 
72, 73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 86, 87, 88, 89, 90, 
91, 93, 94, 95, 98, 99, 100, 101, 109, 110, 111, 113, 114, 115, 

35116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 128, 
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129, 130, 131, 132, 133, 134, 135, 137, 143, 144, 145, 146, 
147, 148, 149, 150, 151, 152, 153, 154, 156, 157, 159, 161, 
163, 164, 165, 166, 167, 169, 170, 171, 173, 174, 175, 176, 
177a, 178, 179, 184, 185, 185b, 185c, 187, 188, 189, 192, 201, 
5202, 207, 208, 210, 215, 216, 217, 219, 221, 222, 223, 223a, 
224, 225, 226, 233, 234, 235, 236, 237, 239, 240, 241 and/or 
242, the numbering being in accordance with the structural 
amino acid numbering defined in Table 1. 
Altered polarity in active site 
10 !t has been shown for rat trypsin that the catalytic 

activity towards the substrate Succinly-Ala-Phe-Lys-AMC can be 
increased with a factor of 1.84 (^\ at /K cat ) by changing the 
active site polarity around Aspl02 through a substitution of 
the conserved Ser214 to Ala (McGrath et al. f Biochemistry 
15(1992) 31, 3059-3064). The substitution S214A causes the 
Ser214-OH group to be replaced by a water molecule, which again 
causes the change in polarivy around Aspl02. 

In accordance herewith, the invention further relates 
to a trypsin- like Fusarium protease, in which one or more amino 
20 acids around the active site has been substituted with an other 
amino acid sequence so as to obtain a change in the polarity 
around the active site. In this context it is contemplated that 
substituting the serine at position 214 in a parent trypsin- 
like Fusarium protease with another amino acid may make the 
25 enzyme more active. 

An example of a variant of the trypsin- like F. 
oxysporum protease according to this embodiment of the inven- 
tion comprises the substitution S214A. 

Removal of Glvcosylation Sites 
30 The concept is to remove critical or potential 

N-glycosylation sites near the binding cleft region as they 
might interfere with binding of substrate. For instance, a 
variant of a trypsin-like F . oxvsporum protease in which the 
N-glycosylation site 223a has been removed may result in an 
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improved substrate binding. More specifically the following 
substitutions are contemplated: N223aS,G,R,K 

Surface Loops near the Active Site 

It has been shown for trypsin (Hedstrom, L et al , ; 

5 Science (1992) 255, pp. 1249-1253) that surface loops near the 
active site influence its specificity and catalytic rate. Thus 
in a further aspect the present invention relates to a trypsin- 
like protease variant improved by substituting any of its 
surface loops near the active site with the corresponding 

10 surface loop from the trypsin- like JF\ oxysporum protease 

disclosed herein. 

In particular substituting any of the active site 
surface loops II, IV, VI, IX, X (in particular pos. 171-178 
thereof), and XIII (in particular pos. 215-221 thereof) defined 

15 above in any trypsin-like protease with a loop that is a) at 
least 60% homologous to the corresponding loop in the trypsin- 
like F. oxvsporum protease, or b) reacts with antibodies raised 
against the corresponding loop of the trypsin-like F. oxysporum 
protease and which recognizes the trypsin-like F . oxysporum 

20protease, is believed to improve properties of the trypsin-like 
protease in question. 

Of course the loop to be inserted may show a higher 
homology to that of the trypsin-like F . oxvsporum protease, for 
instance a homology of at least 80%, such as at least 85%, 90% 

25 or even at least 95% with that of the corresponding loop struc- 
ture of the trypsin-like F. oxvsporum protease. 

The loop structure of a trypsin-like protease which 
"corresponds to 11 a given loop structure of the trypsin-like £\ 
oxvsporum protease may easily be determined by comparison of 

30the three-dimensional structures of the trypsin-like protease 
in question with that of the trypsin-like F_= — oxysporum 
protease. 

The loop structure to be inserted may either be 
provided by substituting one or more amino acid residues of the 
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parent loop structure so as to result in the desired modifica- 
tion, or by substituting the entire loop. 

The properties a) and b) mentioned above is intended 
to be understood in a similar manner to that of properties i) 
Sand ii) defined above. 

It is believed that the parent trypsin-like protease 
to be modified in accordance with this aspect of the invention . 
may be derived from a variety of sources including mammals, 
vertebrates, insects, microorganism and the like. Examples of 
10 mammalian and bacterial trypsin-like proteases are apparent 
from Table 2 above. 

It should be noted that, according to the invention, 
any one of the modifications of the amino acid sequence 
indicated above for the various classes of trypsin-like 
15 protease variants may be combined with any one of the other 
modifications mentioned above /here appropriate. 

Methods of preparing trypsin- li.ke protease variants 

Sev- -3.1 methods for introducing mutations into genes 
are known in the art. After a brief discussion of cloning 
20 trypsin- like protease -encoding DNA sequences (which for 
instance encode functional analogous of the trypsin-like F\ 
oxvsporum protease disclosed herein) , methods for generating 
mutations at specific sites within the trypsin-like protease- 
encoding sequence will be discussed. 

25 Cloning a DNA sequence encoding a trypsin-li ke protease 

The DNA sequence encoding a parent trypsin-like 
protease may be isolated from any cell or microorganism 
producing the trypsin-like protease in question by various 
methods, well known in the art. First a genomic DNA and/ or cDNA 

30 library should be constructed using chromosomal DNA or messen- 
ger RNA from the organism that produces the trypsin-like 
protease to be studied. Then, if the amino acid sequence of the 
trypsin-like protease is known, homologous, labelled oligonu- 
cleotide probes may be synthesized and used to identify 
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trypsin-like protease-encoding clones from a genomic library 
prepared from the organism in question. Alternatively, a 
labelled oligonucleotide probe containing sequences homologous 
to a known trypsin-like protease could be used as a probe to 

5 identify trypsin-like protease-encoding clones, using hybrid- 
ization and washing conditions of lower stringency. 

Yet another method for identifying trypsin-like 
protease-producing clones would involve inserting fragments of 
genomic DNA into an expression vector, such as a plasmid, 

10 transforming trypsin-like protease-negative bacteria with the 
resulting genomic DNA library, and then plating the transformed 
bacteria onto agar containing a substrate for the trypsin-like 
protease thereby allowing clones expressing the trypsin-like 
protease to be identified. 

15 Alternatively, the DNA sequence encoding the enzyme 

may be prepared synthetically by established standard methods, 
e.g. the phosphoamidite method described by S.L. Beaucage and 
M.H. Caruthers, Tetrahedron Letters 22, 1981, pp. 1859-1869, or 
the method described by Matthes et al . , The EMBQ J. 3, 1984, 

20pp. 801-805. According to the phosphoamidite method, oligo- 
nucleotides are synthesized, e.g. in an automatic DNA synthe- 
sizer, purified, annealed, ligated and cloned in appropriate 
vectors . 

Finally, the DNA sequence may be of mixed genomic and 
25 synthetic, mixed synthetic and cDNA or mixed genomic and cDNA 
origin prepared by ligating fragments of synthetic, genomic or 
cDNA origin (as appropriate) , the fragments corresponding to 
various parts of the entire DNA sequence, in accordance with 
standard techniques. The DNA sequence may also be prepared by 
30 polymerase chain reaction (PCR) using specific primers, for 
instance as described in US 4,683,202 or R.K. Saiki et al., 
Science 239 , 1988, pp. 487-491. 

Site-directed mutagenesis 

Once a trypsin-like protease-encoding DNA sequence has 
35 been isolated, and desirable sites for mutation identified, 
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mutations may be introduced using synthetic oligonucleotides. 
These oligonucleotides contain nucleotide sequences flanking 
the desired mutation sites; mutant nucleotides are inserted 
during oligonucleotide synthesis. In a specific method, a 
5 single-stranded gap of DNA, bridging the trypsin-like protease- 
encoding sequence, is created in a vector carrying the trypsin- 
like protease gene. Then the synthetic nucleotide, bearing the 
desired mutation, is annealed to a homologous portion of the 
single- stranded DNA. The remaining gap is then filled in with 

10 DNA polymerase I (Klenow fragment) and the construct is ligated 
using T4 ligase. A specific example of this method is described 
in Morinaga et al. # (1984, Biotechnology 2:646-639). U.S. 
Patent number 4,760,025, by Estell et al . , issued July 26, 
1988, discloses the introduction of oligonucleotides encoding 

15 multiple mutations by performing minor alterations of the 
cassette, however, an even greater variety of mutations can be 
introduced at any one time by the Morinaga method, because a 
multitude of oligonucleotides, of various lengths, can be 
introduced . 

20 Another method of introducing mutations into trypsin- 

like protease -encoding sequences is described in Nelson and 
Long, Analytical Biochemistry 180 , 1989, pp. 147-151. It 
involves the 3 -step generation of a PCR fragment containing the 
desired mutation introduced by using a chemically synthesized 

25 DNA strand as one of the primers in the PCR reactions. From the 
PCR- generated fragment, a DNA fragment carrying the mutation 
may be isolated by cleavage with restriction endonucleases and 
reinserted into an expression plasmid. 

Express ion of trvosin-like protease variants 
30 According to the invention, a mutated trypsin-like 

protease -coding sequence produced by methods described above, 
or any alternative methods known in the art, can be expressed, 
in enzyme form, using an expression vector which typically 
includes control sequences encoding a promoter, operator, 
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ribosome binding site, translation initiation signal, and, 
optionally, a repressor gene or various activator genes. 

The recombinant expression vector carrying the DNA 
sequence encoding a trypsin-like protease variant of the 
5 invention encoding may be any vector which may conveniently be 
subjected to recombinant DNA procedures, and the choice of 
vector will often depend on the host cell into which it is to 
be introduced. Thus, the vector may be an autonomously repli- 
cating vector, i.e. a vector which exists as an extrachromo- 

lOsomal entity, the replication of which is independent of 
chromosomal replication, e.g. a plasmid, a bacteriophage or an 
extrachromosomal element, minichromosome or an artificial 
chromosome. Alternatively, the vector may be one which, when 
introduced into a host cell, is integrated into the host cell 

15genome and replicated together with the chromosome (s) into 
which it has been integrated. 

In the vector, the DNA sequence should be operably 
connected to a suitable promoter sequence. The promoter may be 
any DNA sequence which shows transcriptional activity in the 

20 host cell of choice and may be derived from genes encoding pro- 
teins either homologous or heterologous to the host cell. 
Examples of suitable promoters for directing the transcription 
of the DNA sequence encoding a trypsin-like protease variant of 
the invention, especially in a bacterial host, are the promoter 

25 of the lac operon of E.coli , the Streptomyces coelicolor aga- 
rase gene dagA promoters, the promoters of the Bacillu s li- 
cheniformis or-amylase gene ( amv L) , the promoters of the Ba- 
cillus stearothermophilus maltogenic amylase gene (amyM) , the 
promoters of the Bacillus Amyloliauef aciens a-amylase ( amv O) , 

30 the promoters of the Bacillus subtilis xylA and xylB genes etc. 
For transcription in a fungal host, examples of useful promo- 
ters are those derived from the gene encoding A. orvzae TAKA 
amylase, Rhizomucor miehei aspartic proteinase, A. niqer neu- 
tral of-amylase, A. niqer acid stable a-amylase, A. niqer glu- 

35coamylase, Rhizomucor miehei lipase, A. orvzae alkaline 
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protease, A. orvzae triose phosphate isomerase or A. nidulans 
acetamidase. 

The expression vector of the invention may also 
comprise a suitable transcription, terminator and, in 

Seukaryotes, polyadenylation sequences operably connected to the 
DNA sequence encoding the recombinant protease of the inven- 
tion. Termination and polyadenylation sequences may suitably be 
derived from the same sources as the promoter. 

The vector may further comprise a DNA sequence 

lOenabling the vector to replicate in the host cell in question. 
Examples of such sequences are the origins of replication of 
plasmids pUC19, pACYC177, pUBHO, pE194, pAMBl and pIJ702. 

The vector may also comprise a selectable marker, e.g. 
a gene the product of which complements a defect in the host 

15 cell, such as the dal genes from B.subtilis or B . lichenif ormis, 
or one which confers antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol or tetracyclin resistance. Fur- 
thermore, the vector may comprise Aspergillus selection markers 
such as amdS, argB, niaD and sC, a marker giving rise to 

20hygromycin resistance, or the selection may be accomplished by 
co-transformation, e.g. as described in WO 91/17243. 

While intracellular expression may be advantageous in 
some respects, e.g. when using certain bacteria as host cells, 
it is generally preferred that the expression is extracellular. 

25 As mentioned above the trypsin-like F - oxvsporum protease com- 
prising the amino acid sequence shown in the SEQ ID No. 2 
comprises a preregion permitting secretion of the expressed 
protease into the culture medium. If desirable, this preregion 
may be substituted with a different preregion or signal 

30 sequence, convenient accomplished by substitution of the DNA 
sequences encoding the respective preregions. 

The procedures used to ligate the DNA construct of the 
invention encoding a trypsin-like protease variant, the 
promoter, terminator and other elements, respectively, and to 

35 insert them into suitable vectors containing the information 
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necessar y for replication, are well known to persons skilled in 
the art (cf., for instance, Sambrook et al . (1989)). 

The cell of the invention either comprising a DNA con- 
struct or an expression vector of the invention as defined 

5 above is advantageously used as a host cell in the recombinant 
production of a trypsin-like protease variant of the invention. 
The cell may be transformed with the DNA construct of the 
invention encoding the variant, conveniently by integrating the 
DNA construct in the host chromosome. This integration is 

10 generally considered to be an advantage as the DNA sequence is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g. by homologous or 
heterologous recombination. Alternatively, the cell may be 

15 transformed with an expression vector as described below in 
connection with the different types of host cells. 

The cell of the invention may be a cell of a higher 
organism such as a mammal or an insect, but is preferably a 
microbial cell, e.g. a bacterial or a fungal (including yeast) 

20 cell. 

Examples of suitable bacteria are grampositive 
bacteria such as Bacillus subtilis , Bacillus lichenif ormis , 
Bacillus lentus . Bacillus brevis , Bacillus stearo thermophilus , 
Bacillus alkalophilus , Bacillus amvloliauef aciens , Bacillus 

25 coaoulans . Bacillus circulans . Bacillus lautus, Bacillus 

meaaterium . Bacillus t huringiensis, or Streptomyces lividans or 
Streptomyces murinus , or gramnegative bacteria such as E.coli. 
The transformation of the bacteria may for instance be effected 
by protoplast transformation or by using competent cells in a 

30 manner known per se . 

The yeast organism may favourably be selected from a 
species of Saccharomvces or Schizosaccharomvces , e.g. Saccharo- 
myces cerevisiae . The filamentous fungus may advantageously 
belong to a species of Aspergillus , e.g. Aspergillus oryzae or 

35 Aspergillus niger . Fungal cells may be' transformed by a process 
involving protoplast formation and transformation of the 
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protoplasts followed by regeneration of the cell wall in . a 
manner known per se . A suitable procedure for transformation of 
Aspergillus host cells is describe ■•?. in EP 238 023. 

In a yet further aspect, ..V.a present invention relates 
5 to a method of producing a trypsin-like protease variant of the 
invention, which method comprises cultivating a host cell as 
described above under conditions conducive to the production of 
the protease and recovering the protease from the cells and/or 
culture medium. 

10 The medium used to cultivate the cells may be any 

conventional medium suitable for growing the host cell in 
question and obtaining expression of the protease variant of 
the invention. Suitable media are available from commercial 
suppliers or may be prepared according to published recipes 

15 (e.g. in catalogues of the American Type Culture Collection) . 

The trypsin-like protease variant secreted from the 
host cells may conveniently be recovered from the culture 
medium by well-known procedures including separating the cells 
from the medium by centrifugation or filtration, and precipi- 

20tating proteinaceous components of the medium by means of a 
salt such as ammonium sulphate, followed by chromatographic 
procedures such as ion exchange chromatography, affinity 
chromatography, or the like. 

Detergent Compositions 

25 According to the invention, the protease variant may 
typically be a component of a detergent composition. As such, 
it may be included in the detergent composition in the form of 
a non-dusting granulate, a stabilized liquid, or a protected 
enzyme. Non-dusting granulates may be produced, e.g., as 

30 disclosed in US 4,106,991 and 4, 661, 452 (both to Novo Industri 
A/S) and may optionally be coated by methods known in the art. 
Examples of waxy coating materials are poly (ethylene oxide) 
products (polyethyleneglycol, PEG) with mean molecular weights 
of 1000 to 20000; ethoxylated nonylphenols having from 16 to 50 

35 ethylene oxide units; ethoxylated fatty alcohols in which the 
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alcohol contains from 12 to 20 carbon atoms and in which there 
are 15 to 80 ethylene oxide units; fatty alcohols; fatty acids; 
and mono- and di- and triglycerides of fatty acids. Examples of 
film- forming coating materials suitable for application by 
5 fluid bed techniques are given in patent GB 1483591. Liquid 
enzyme preparations may, for instance, be stabilized by adding 
a polyol such as propylene glycol, a sugar or sugar alcohol, 
lactic acid or boric acid according to established methods. 
Other enzyme stabilizers are well known in the art. Protected 
10 enzymes may be prepared according to the method disclosed in EP 
238,216. 

The detergent composition of the invention may be in any 
convenient form, e.g. as powder, granules, paste or liquid. A 
liquid detergent may be aqueous, typically containing up to 70% 

15 water and 0-30% organic solvent, or nonaqueous. 

The detergent composition comprises one or more surfactants, 
each of which may be anionic, nonionic, cationic, or zwitter- 
ionic. The detergent will usually contain 0-50% of anionic 
surfactant such as linear alkylbenzenesulf onate (LAS) , alpha- 

20olefinsulfonate (AOS) , alkyl sulfate (fatty alcohol sulfate) 
(AS) , alcohol ethoxysulfate (AEOS or AES) , secondary alkane- 
sulfonates (SAS) , alpha-sulfo fatty acid methyl esters, alkyl - 
or alkenylsuccinic acid, or soap. It may also contain 0-40% of 
nonionic surf actant such as alcohol ethoxylate (AEO or AE) , 

25 carboxylated alcohol ethoxylates, nonylphenol ethoxylate, 
alkylpolyglycoside, alkyldimethylamine oxide, ethoxylated fatty 
acid monoethanolamide, fatty acid monoethanolamide, or 
polyhydroxy alkyl fatty acid amide (e.g. as described in WO 
92/06154). 

30 The detergent composition may additionally comprise one or 
more other proteases as well as one or more other enzymes 
conventionally used in detergent compositions, such as an 
amylase, a lipase, a cutinase, a cellulase, a peroxidase, 
and/or an oxidase, e.g., a laccase. 

35 The detergent may contain 1-65% of a detergent builder or 
complexing agent such as zeolite, diphosphate, triphosphate, 
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phosphonate, citrate, nitrilotriacetic acid (NTA) , ethylene- 
diaminetetraacetic acid (EDTA) , diethylenetriaminepentaacetic 
acid (DTMPA) , alkyl- or alkenylsuccinic acid, soluble silicates 
or layered silicates (e.g. SKS-6 from Hoechst) . The detergent 
5 may also be unbuilt, i.e. essentially free of detergent 
builder. 

The detergent may comprise one or more polymers. Examples 
are carboxymethylcellulose (CMC) , poly (vinylpyrrolidone) (PVP) , 
polyethyleneglycol (PEG), poly (vinyl alcohol) (PVA) , poly- 

lOcarboxylates such as polyacrylates, maleic/acrylic acid 
copolymers and lauryl methacrylate/acrylic acid copolymers. 

The detergent may contain a bleaching system which may 
comprise a H 2 0 2 source such as perborate or percarbonate which 
may be combined with a peracid- forming bleach activator such as 

15 tetraacetylethylenediamine (TAED) or nonanoyloxybenzene- 
sulfonate (NOBS) . Alternatively, the bleaching system may 
comprise peroxyacids of, e.g., the amide, imide, or sulfone 
type. 

The enzymes of the detergent composition of the invention 
20 may be stabilized using conventional stabilizing agents, e.g. 
a polyol such as propylene glycol or glycerol, a sugar or sugar 
alcohol, lactic acid, boric acid, or a boric acid derivative 
such as, e.g., an aromatic borate ester, and the composition 
may be formulated as described in, e.g., WO 92/19709 and WO 
25 92/19708. 

The detergent may also contain other conventional detergent 
ingredients such as, e.g., fabric conditioners including clays, 
foam boosters, suds suppressors, anti- corrosion agents, soil- 
suspending agents, anti-soil-redeposition agents, dyes, 
30 bactericides, optical brighteners, or perfume. 

The pH (measured in aqueous solution at use concentration) 
will usually be neutral or alkaline, e,g. in the range of 7-11. 

Particular forms of detergent compositions within the scope 
of the invention include: 
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1) A detergent composition formulated as a granulate having 
bulk density of at least 600 g/1 comprising 



t ^«aqv al Itt rl Kon p 7ca - n^c:il"l ^ OTIS t & { Cell 
linear aUvyiDeuZcilcbuxiuiiauc; 

culated as acid) 


7 


12% 


Alcohol ethoxysulfate (e.g. C„. 18 
alcohol, 1-2 EO) or alkyl sulfate 
<e-g- c ia-ir> , — 


1 


4% 


Alcohol ethoxylate (e.g. C u . 15 alco- 
hol, 
7 EO) 


5 


9% 


Sodium carbonate (as Na ? CO,) 


14 


20% 


Qnlnhlp qiI irate (as Na^O,2SiO^) 


2 


6% 


Oolite (as NaAlSiO,,) 


15 


22% 


Sodium sulfate (as Na ? SOj 


0 


6% 


Sodium citrate/citric acid 
(as CANa,0 7 /CA0 7 ) 


0 


15% 


Sodium perborate (as NaBO,.H,0) 


11 


18% 


TAED 


2 


6% 


Carboxymethylcellulose 


0 


2% 


Polymers (e.g. maleic/acrylic acid 
copolymer, PVP, PEG) 


0 


3% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. suds 
suppressors, perfume, optical 
I brightener, photobleach) 


0 


5% 



2) A detergent composition formulated as a granulate having 
bulk density of at least 600 g/1 comprising 



30 


Linear alkylbenzenesulfonate (cal- 
culated as acid) 


6 - 


11% 


Alcohol ethoxysulfate (e.g. C 12 . 1B 
alcohol, 1-2 EO or alkyl sulfate 
(e.g. C„. 1W ) 


1 - 


3% 


35 


Alcohol ethoxylate (e.g. C„. 15 alco- 
hol, 
7 EO) 


5 - 


9% 




Sodium carbonate (as Na ? C0,) 


15 - 


21% 



WO 95/30743 



PCT/DK95/00180 



45 



jQlUDie SlllCdLe \3.3 rJci^U , ZDlU^j 


1 - 4% 


Zeolite (as NaAlSiOj 


24 - 34% 


Sodium sulfate (as Na ? S0j 


4 - 10% 


Sodium citrate/citric acid 
(as CANa,0 7 /C A H ft 0 7 ) 


0 - 15% 


Carboxymethyl cellulose 


0-2% 


Polymers (e.g. maleic/acrylic acid 
copolymer, PVP, PEG) 


1-6% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. suds 
suppressors, perfume) 


0 - 5% 



3) A detergent composition formulated as a granulate having a 
bulk density of at least 600 g/1 comprising 



15 


Linear alkylbenzenesulf onate (cal- 
culated as acid) 


5 


- 9% 




Alcohol ethoxylate (e.g. C 12 . 15 alco- 
hol, 
7 EO) 


7 


- 14% 


20 


Soap as fatty acid (e.g. C 16-22 fatty, 
acid) 


1 


- 3% 




Sodium carbonate (as Na ? CQ,) 


10 


- 17% 




Soluble silicate (as NapO,2SiOp) 


3 


- 9% 




Zeolite (as NaAlSiOj 


23 


- 33% 


25 


Sodium sulfate (as Na 9 S04) 


0 


- 4% 




Sodium perborate (as NaBO^.HpO) 


8 


- 16% 




TAED 


2 


8% 




Phosphonate (e.g. EDTMPA) 


0 


- 1% 




Carboxymethylcellulose 


0 


- 2% 


30 


Polymers (e.g. maleic/acrylic acid 
copolymer, PVP, PEG) 


0 


- 3% 




Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


35 


Minor ingredients (e.g. suds 
suppressors, perfume, optical 
brightener) 


0 


- 5% 
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4) A detergent composition formulated as a granulate having a 
bulk density of at least 600 g/1 comprising 



I Linear alkylbenzenesulf onate (cal- 

| CUxaLcQ as aL. 1U j 


8 


- 12% 


Alcohol ethoxylate (e.g. C 12 . 15 alco- 
nox , 
7 EO) 


10 


- 25% 


QnHinm rarhnnatp fas Na«CO-») 


14 


- 22% 


Soluble silicate (as Na ? 0,2SiO ? ) 


1 


- 5% 


Zeolite (as NaAlSiOj 


25 


- 35% 


Sodium sulfate (as NapSO^) 


0 


- 10% 


Carboxymethylcellulose 


0 


- 2% ! 


Polymers (e.g. maleic/acrylic acid 
copolymer, PVP, PEG) 


1 


- 3% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. suds 
suppressors , perfume ) 


0 


- 5% 


5) An aqueous liquid detergent composition comprising 


Linear alkylbenzenesulf onate (cal- 
culated as acid) 


15 


- 21% 


Alcohol ethoxylate (e.g. C 12 . 15 alco- 
hol, 

7 EO or C 15 . 1f; alcohol, 5 EO) 


12 


- 18% 


Soap as fatty acid (e.g. oleic 
acid) 


3 


- 13% 


Alkenyl succinic acid (C w . u ) 


0 


- 13% | 


Aminoethanol 


8 


- 18% 


Citric acid 


2 


- 8% 


Phosphonate 


0 


- 3% 


Polymers (e.g. PVP, PEG) 


0 


- 3% 


Borate (as B L 0 7 ) 


0 


- 2% 


Ethanol 


0 


- 3% 


Propylene glycol 


8 


- 14% 
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Enzymes (calculated as pure enzyme 
protein) 


. 0.0001 - 0.1% 


Minor ingredients (e.g. 
dispersants, suds suppressors, per- 
fume, optical brightener) 


0 -5% 



6) An aqueous structured liquid detergent composition compris- 



ing 



Linear alkylbenzenesulf onate 
(calculated as acid) 


15 


- 21% 


Alcohol ethoxylate (e.g. C l2 . 15 
alcohol, 

7 EO, or C 15 . iq alcohol, 5 EO) 


3 


- 9% 


Soap as fatty acid (e.g. oleic 
acid) 


3 


- 10%' 


Zeolite (as NaAlSiOJ 


14 


- 22% 


Potassium citrate 


9 


- 18% 


Borate (as B^0 7 ) 


0 


- 2% 


Carboxymethylcellulose 


0 


- 2% 


Polymers (e.g. PEG, PVP) 


0 


- 3% 


Anchoring polymers such as, e.g., 
lauryl methacrylate/acrylic acid 
copolymer; molar ratio 25:1; MW 
3800 


0 


- 3% 


Glycerol 


0 


- 5% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. 
dispersants, suds suppressors, 
perfume, optical brighteners) 


0 


- 5% 



30 7) A detergent composition formulated as a granulate having a 



bulk density of at least 600 g/1 comprising 



Fatty alcohol sulfate 


5 - 10% 


Ethoxylated fatty acid monoethanol- 
amide 


3 - 9% 


Soap as fatty acid 


0 - 3% 


Sodium carbonate (as Na ? C0 3 ) 


5 - 10% 
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Soluble silicate (as Na ? 0, 2SiO ? ) 


1 - 4% 


Zeolite (as NaAisiu A ; 


20 - 40% 


Sodium sulfate (as Na^SOJ 


2 - 8% 


Sodium perborate (as NaBO, .H 7 0) 


12 - 18% 


TAED 


2 - 7% 


Polymers (e.g. maleic/acrylic acid 
copolymer, PEG) 


1 - 5% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. optical 
bright ener, suds suppressors, per- 
fume) 


0 - 5% 



10 



8) A detergent composition formulated as a granulate comprising 



15 



20 



25 



Linear alkylbenzenesulf onate 
(calculated as acid) 


8 


- 14% 


Ethoxylated fatty acid monoethanol- 
amide 


5 


- 11% 


Soap as fatty acid 


0 


- 3% 


Sodium carbonate (as Na ? CO^) 


4 


- 10% 


Soluble silicate (as Na P 0,2SiO ? ) 


1 


- 4% 


Zeolite (as NaAlSiOj 


30 


- 50% 


Sodium sulfate (as Na ? SO A ) 


3 


- 11% 


Sodium citrate (as C^Na^Oy) 


5 


- 12% 


Polymers (e.g. PVP, maleic/acrylic 
acid copolymer, PEG) 


1 


- 5% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. suds 
suppressors , perfume ) 


0 


- 5% 


9) A detergent composition formulated as a granulate compr: 


Linear alkylbenzenesulf onate 
(calculated as acid) 


6 


- 12% 


Nonionic surfactant 


1 


- 4% 


Soap as fatty acid 


2 


- 6% 
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Sodium carbonate {as Na^CO,) 


14 


- 22% 


Zeolite (as NaAlSiOj 


18 


- 32% 


Sodium sulfate (as Na ? SO A ) 


5 


- 20* 


Sodium citrate (as C A H q Na,0 7 ) 


3 


- 8% 


Sodium perborate (as NaB0 3 .H ? 0) 


4 


- 9% 


Bleach activator (e.g. NOBS or 
TAED) 


1 


- 5% 


Carboxymethylcellulose 


0 


- 2% 


Polymers (e.g. polycarboxylate or 
PEG) 


1 


- 5% 


Enzymes (calculated as pure enzyme . 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. optical 
brightener, perfume) j 


0 


- 5% 



15 10) An aqueous liquid detergent composition comprising 



Linear alkylbenzenesulfonate 
(calculated as acid) 


. 15 


- 23% 


Alcohol ethoxysulf ate (e.g. C 12 . 15 
alcohol, 2-3 EO) 


8 


- 15% 1 


Alcohol ethoxylate (e.g. C 12 . 15 al- 
cohol, 7 EO, or C 12 . 15 alcohol, 5 
EO) 


3 


- 9% 


Soap as fatty acid (e.g. lauric 
acid) 


0 


- 3% 


Aminoethanol 


1 


- 5% 


Sodium citrate 


5 


- 10% 


Hydrotrope (e.g. sodium 
t oluensul f onate ) 


2 


- 6% 


Borate (as B L 0 7 ) 


0 


- 2% 


Carboxymethylcellulose 


0 


- 1% 


Ethanol 


1 


- 3% 


Propylene glycol 


2 


- 5% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 
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Minor ingredients (e.g. polymers, 
dispersants, perfume, optical 
bright eners) 


0 - 5% 


11) An aqueous liquid detergent composition comprising 


Linear alkylbenzenesulf onate 
(calculated as acid) 


| 

20 - 32% 


Alcohol ethoxylate (e.g. C 12 15 alco- 
hol, 

7 EO, or C 19 _ 1t; alcohol, 5 EO) 


6 - 12% 


Aminoethanol 


| 2 - 6% 


Citric acid 


8 - 14% 


Borate (as B L 0 7 ) 


1 - 3% 


Polymer (e.g. maleic/acrylic acid 
copolymer, anchoring polymer such 
as, e.g., lauryl 
methacrylate/acrylic acid 
copolymer) 


0 - 3% 


Glycerol 


3 - 8% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. hydro- 
tropes, dispersants, perfume, 
optical brighteners) 


0 - 5% 



12) A detergent composition formulated as a granulate having 
25a bulk density of at least 600 g/1 comprising 



Anionic surfactant (linear 
alkylbenzenesulf onate , alkyl sul fa- 
te, alpha-olef insulf onate, alpha - 
sulfo fatty acid methyl esters, 
alkanesulf onates , soap) 


25 


- 40% 


Nonionic surfactant (e.g. alcohol 
ethoxylate) 


1 


- 10% 


Sodium carbonate (as Na,CO x ) 


8 


- 25% 


Soluble silicates (as Na,0, 2S±0 ? ) 


5 


- 15% 


Sodium sulfate (as Na^SOj 


0 


- 5% 


Zeolite (as NaAlSiOj 


15 


- 28% 


Sodium perborate (as NaB0,.4H ? 0) 


0 


- 20% 
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Bleach activator (TAED or NOBS) 


0 - 5% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. perfume, 
optical brighteners) 


0 - 3% 



13) Detergent formulations as described in 1) - 12) wherein all 
or part of the linear alkylbenzenesulfonate is replaced by (C 12 - 
C 18 ) alkyl sulfate. 

14) A detergent composition formulated as a granulate having 
10a bulk density of at least 600 g/1 comprising 



(C„-C 1A ) alkyl sulfate 


9 


- 15% 


Alcohol ethoxylate 


3 


- 6% 


Polyhydroxy alkyl fatty acid amide 


1 


- 5% 


Zeolite (as NaAlSiOj 


10 


- 20% 


Layered disilicate (e.g. SK56 from 
Hoechst ) 


10 


- 20% 


Sodium carbonate (as Na ? CO,) 


3 


- 12% 


Soluble silicate (as Na^O^SiO,) 


0 


- 6% 


Sodium citrate 


4 


- 8% 


Sodium percarbonate 


13 


- 22% 


TAED 


3 


- 8% 


Polymers (e.g. polycarboxylates and 
PVP= 


0 


" 5% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. optical 
bright ener, photo bleach, perfume, 
suds suppressors) 


0 


- 5% 



15) A detergent composition formulated as a granulate having 
30a bulk density of at least 600 g/1 comprising 
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Alcohol ethoxylate 


11 


- 15% 


Soap 


1 


- 4% 


Zeolite MAP or zeolite A 


35 


- 45% 


Sodium carbonate (as Na ? C0 7 ) 


2 


- 8% 


Soluble silicate (as Na.,0, 2Si0 7 ) 


n 
U 


A 2- 


Sodium percarbonate 


13 


- 22% 


TAED 


1 


8% 


Carboxymethyl cellulose 


0 


- 3% 


Polymers (e.g. polycarboxylates and 
PVP) 


0 


- 3% 


Enzymes (calculated as pure enzyme 
protein) 


0.0001 


- 0.1% 


Minor ingredients (e.g. optical 
brightener, phosphonate, perfume) 


0 


- 3% 



1516) Detergent formulations as described in 1) - 15) which 
contain a stabilized or encapsulated peracid, either as an 
additional component or as a substitute for already specified 
bleach systems. 

17) Detergent compositions as described in 1), 3), 7), 9) and 
2012) wherein perborate is replaced by percarbonate. 

18) Detergent compositions as described in 1), 3) , 7) , 9) , 12) , 
14) and 15) which additionally contain a manganese catalyst. 
The manganese catalyst may, e.g., be one of the compounds 
described in "Efficient manganese catalysts for low- temperature 

25bleaching", Nature 369, 1994, pp. 637-639. 

19) Detergent composition formulated as a nonaqueous detergent 
liquid comprising a liquid nonionic surfactant such as, e.g., 
linear alkoxylated primary alcohol, a builder system (e.g. 
phosphate) , enzyme and alkali. The detergent may also comprise 

30 anionic surfactant and/or a bleach system. 
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The protease variant of the invention may be incorporated 
in concentrations conventionally employed in detergents. It is 
at present contemplated that, in the detergent composition of 
the invention, the protease variant may be added in an amount 
5 corresponding to 0.00001-1 mg (calculated as pure enzyme 
protein) of protease variant per liter of wash liquor. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) N&ME: NOVO NORDISK A/S 
5 (B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) 03GNTRY: DENMARK 

(F) POSTAL CODE (ZIP) : DK-2880 

(G) TELEPHCaSE: +45 44448888 
10 (H) TELEFAX: +45 4449 3256 

(I) TELEX: 37304 

(ii) TITLE OF INVENTION: Trypsin- like Fusarium protease 
(iii) NUMBER OF SEQUENCES: 2 

15 (iv) CCMPUIER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) CCMPUIER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFIWARE: Patent In Release #1.0, Version #1.25 (EPO) 

20 (2) INFORMATION FOR SEJQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) NDLECULE TYPE: cCNA 
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(iii) 



HYPOTHETICAL: NO 



(iii) 



ANTI -SENSE: NO 



(v) 



FRAGMENT TYPE: internal 



(vi) 



ORIGINAL SOURCE: 
(A) ORGANISM: Rosarium axysporum 



(B) STRAIN: DSM 2672 

(ix) FEATURE: 

(A) NAME/KEY: miscjEeature 

(B) LOCATION: 1..998 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
ATCATCAACC ACTCTTCACT CITCAACICT CCTCTCITOG ATATCIATCT CTTCACCAIG 
GTCAAGITCG CTTCOJICGT TCCACTTGTT GCTOCCCTGG CIGCIGC03C TCTTCAGGAG 
ATCOOCAACA TTCTETGGTOG CACTTCTOCC AG03CIQGC3 ACITBXJCTT CATOCTGEAGC 
ATTAGOOGCA AOGGT3GCCC CK3GICT3GA GGT1LTLTCC TCAAOGCCAA CACXCTCTIG 

15ACIGCIGQCC ACTOCCTITC OOGATACGCT CAGAGCGCTT TCCAGATTOG TGCTOGCAGT 
CICTCT03CA CITCTOCTGG TATTACCTCC TOSCTITCCr OOCTCAC^CT TCACCCTAGC 
TACAG03GAA ACAACAAOSA TCTT3CTATT CI^AGCTCT CTACITCCAT COCXTTCCGGC 
GGAAACATCG GCTATGCTOG OCTOGCIGCT TCOGGCTCIG ACCCTCT03C TQGATCTTCT 
GCCACIGITS CT3GCIGQGG OGCTACCTCT GAQQG03GCA GCTCTACTCC OIIX^ACCrr 

2DCIGAAGCTEA CICTCCCTAT CGTCTCTCJT GCIAGCIX30C GAGCTCAGTA 03GCACCT0C 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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GCCATCAOCA ACCAGATCTT CICTGCK3CT GTETIUITOCG GIQGCAAGGA CTCTTGCCAG 660 

GGK5£AG0G G03GCOCCAT CCTCGAGAGC TCO^ACACIC TTATOGGIGC TCIUIUITOG 720 

GGIAACGGAT GimXGAQC CAACIACICT GGICTCIATG CGAGOoTTOG TCCTCT003C 780 

TCTITCAJTG AOKCTATOC TIAAATACCT TCTTOGAAGC GIKjGAGATCT TCrTTGAATA 840 

5 TIUTCEAGCT TCACTCTK3G ATAOGAAACC TCTITGAGAA ATAQCJTTTCA ACGAGTTAAG 900 

AAGATATCAG TIGATTTCAG TIGGATCTIA GICCIGGTIG CTOGIAATAG AGCAATCIAG 960 

ATAGCQCAAA TTCAATATGA AATITCATCA AAATATIC 998 

(2) INFORMATION FDR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LEN3IH: 248 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

15 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: DSM 2672 

(ix) FEATORE: 
20 (A) NAME/KEY: Protein 

(B) LOCATION: 25 ,.248 

(ix) FEATURE: 
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(A) NAME/KEY: Peptide 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /label= pre-prupeptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

5 Met Val Lys Phe Ala Ser Val Val Ala Leu Val Ala Pro Leu Ala Ala 

-20 -15 -10 

Ala Ala Pro Gin Glu He Pro Asn He Val Gly Gly Thr Ser Ala Ser 
-5 1 5 

Ala Gly Asp Phe Pro Phe He Val Ser He Ser Arg Asn Gly Gly Pro 
10 10 15 20 

Trp Cys Gly Gly Ser Leu Leu Asn Ala Asn Thr Val Leu Thr Ala Ala 
25 30 35 40 

His cys Val Ser Gly Tyr Ala Gin Ser Gly Phe Gin He Arg Ala Gly 
45 50 55 

15 Ser Leu Ser Arg Thr Ser Gly Gly He Thr Ser Ser Leu Ser Ser Val 
60 65 70 

Arg Val His Pro Ser Tyr Ser Gly Asn Asn Asn Asp Leu Ala He leu 
75 80 85 

Lys Lsu Ser Thr Ser He Pro Ser Gly Gly Asn He Gly Tyr Ala Arg 
20 90 95 100 

Leu Ala Ala Ser Gly Ser Asp Pro Val Ala Gly Ser Ser Ala Thr Val 
105 110 115 120 



Ala Gly Tip Gly Ala Thr Ser Glu Gly Gly Ser Ser Thr Pro Val Asn 
125 130 135 
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Leu Leu Lys Val Thr Val Pro He Val Ser Arg Ala Thr Cys Arg Ala 



140 



145 



150 



Gin Tyr Gly Thr Ser Ala lie Thr Asn Gin Met Phe Cys Ala Gly Val 
155 160 165 

5 Ser Ser Gly Gly Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro He 
170 175 180 



Val Asp Ser Ser Asn Thr Leu lie Gly Ala Val Ser Trp Gly Asn Gly 

190 195 200 



185 



10 



Cys Ala Arg Pro Asn Tyr Ser Gly Val Tyr Ala Ser Val Gly Ala Leu 
205 210 215 



Arg Ser Phe He Asp Thr Tyr Ala 
220 
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CLAIMS 

1. A variant of a parent trypsin-like Fusarium 
protease comprising the amino acid sequence shown in the 
appended Sequence Listing ID No. 2, which 

5 i) reacts with an antibody raised against or reactive 

with at least one epitope of the parent trypsin-like protease, 

ii) is at least 60% homologous with the amino acid 
sequence of the parent trypsin-like protease, and/or 

iii) is encoded by a DNA sequence which hybridizes 
10 with an oligonucleotide probe hybridizing with the trypsin-like 

F. oxvsporum protease comprising the amino acid sequence shown 
in SEQ ID No. 2. 

2. A trypsin-like Fusarium protease variant according 
to claim 1, in which a naturally occurring amino acid residue 

15 (other than proline) of the amino acid sequence of the parent 
trypsin-like protease has been substituted with a proline re- 
sidue at one or more positions, at which positions (s) the 
dihedral angles <f> (phi) and ^ (psi) constitute values within 
the intervals [-9O°<0<-4O° and -180°<i|r<180 o ] , preferably within 

20the intervals [-90°«£<-40° and 120°<ifr<180 o ] or [-9O°<0<-4O° and 
-50°<^<10°] , and which position (s) is/are not located in re- 
gions in which the protease is characterized by possessing a- 
helical or fi-sheet structure. 

3. The variant according to claim 2, in which the 
25 amino acid residue (s) to be substituted with proline is a 

hydrophilic or a small hydrophobic amino acid residue, in 
particular one selected from the amino acid residues A, D, E, 
K, G, Q, R, S, T, N and V. 

4. The variant according to claim 2, in which one or 
30 more of the amino acid residues occupying the positions 21, 24, 

30, 46, 49, 71, 90, 111, 114, 121, 124, 125, 126, 132, 135, 
146, 148, 174, 175, 176, 178, 185b, 185c, 202, 208, and/or 209 
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of the parent trypsin-like protease (using the amino acid 
numbering defined in Table 1) has/have been substituted with 
proline residues. 

5. A trypsin-like Fusarium p rotease variant according 
5 to claim 1, in which an amino acid residue different from cy- 
steine of the amino acid sequence of the parent trypsin-like 
protease has been substituted with a cysteine residue in such 
a manner that the introduced cysteine residue together with 
another cysteine residue present in the parent protease or 

10 introduced therein form a disulphide bridge. 

6. The variant according to claim 5, in which the 
amino acid residues occupying the positions 22+157, 129+232 
and/or 136+201 of the parent trypsin-like protease (using the 
amino acid numbering defined in Table 1) have been substituted 

15 with cysteine residues. 

7. The variant according to claim 6 comprising the 
following substitutions: 

A22C+D26S+K157C, or 
A136C+D201C. 

20 8 . A trypsin-like Fusarium protease variant according 

to claim 1, in which either or both residues of any of the Asn- 
Gly sequence appearing in the amino acid sequence of the parent 
trypsin-like protease is/are deleted or substituted with a 
residue of a different amino acid. 

25 9 . The variant according to claim 8 , in which the Asn 

and/or Gly residue is substituted with a residue of an amino 
acid selected from the group consisting of A, Q, S, P, T and Y. 

10. The variant according to claim 8, in which any of 
the Asn or Gly residues of the Asn-Gly occupying positions 
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36+38 and/or 217+219 of the parent trypsin-like protease (using 
the amino acid numbering defined in Table 1) has/have been 
deleted or substituted with a residue of an amino acid selected 
from the group consisting of A, Q, S, P, T and Y. 

5 11. The variant according to claim 10 comprising one 

or more of the following substitutions: 
N36S, G38S, N217Y,S, G119S 

12. A trypsin-like Fusarium protease variant, in 
which one or more amino acid residues present in a loop 

10 structure of the parent trypsin-like protease suceptible to 
cleavage by a proteolytic enzyme is/are deleted or replaced 
with one or more amino acid residues so as to obtain a modified 
loop structure having an improved proteolytic stability. 

13. The variant according to claim 12 , in which the 
15 loop structure is suceptible to cleavage by a subtilisin or 

trypsin. 

14. The variant according to claim 12, in which one 
or more amino acid substitutions has/have been made so that the 
modified loop structure comprises at least one of the amino 

20 acid segments GAAG, GARG, YPGS, YPRS, HNRG, YTGN, ISSE, NNAG, 
SFIN, DQNG, ASFS, SRGV, LDTG, YYAA, INDI, WYFG, SIEN, GSTY, 
DSTN, PDLR, LDTG, GNRY, SGVM, RYPS, NGLV, SFSI, LGSP, RASF, 
VPWG, PDLN, SFVP, PDYR, PRLP, TVLP, IGTC, TGGT, TNKL, VGDV, 
IGVL, GSTY, RYAN, PNIP, or TLVP. 

25 15. The variant according to claim 12, in which loop 

II of the trypsin-like protease comprising the peptide sequence 
SRNGGP is substituted with loop II of the trypsin 2ptn isolated 
from Bos taurus comprising the peptide sequence NSGYH, as 
follows: 



30 S3 4N+R3 5S+N3 6 * +G3 9Y+ P4 OH , and/or 
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loop IV of the trypsin- like protease comprising the 
peptide sequence VSGYAQSGF is substituted with loop IV of the 
trypsin 2ptn isolated from Bos taurus comprising the peptide 
sequence YKSGI, as follows: 

5V59Y+S59a*+G59b*V*59c*+A60K+Q61S+S62*+F64I, and/or 

loop IV of the trypsin- like protease comprising the 
peptide sequence VSGYAQSGF is substituted with loop IV of the 
trypsin Itrm isolated from Rattus rattus comprising the peptide 
sequence YKSRI, as follows: 

10V59Y+S59a*+G59b*+Y59c*+A60K+Q61S+S62*+G63R+F64I, and/or 

loop VI of the trypsin- like protease comprising the 
peptide sequence HPSYSGN is substituted with loop VI of the 
trypsin ltrm isolated from Rattus rattus comprising the peptide 
sequence HPNFDRKTL, as follows: 

15S93N+F94Y+S95D+*96R+*97K+G98T+N99L. 

16. A trypsin-like Fusarium protease variant accord- 
ing to claim 1, in which an autoproteolysis site of the parent 
trypsin-like protease has been removed. 

17. The variant according to claim 16, in which an 
20Arg and/or Lys residue of the parent trypsin-like protease has 

been substituted with a non or less positively charged amino 
acid residue, or in which an Arg residue has been substituted 
with a Lys residue. 

18. The variant according to claim 17, in which the 
25 non or less positively charged amino acid residue is selected 

from the group consisting of K, S, V, P, E, D, N, Q, A and G. 
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19. The variant according to claim 16, in which the 
Lys and/or Arg residue at the following positions have been 
substituted: 

35, 67, 73, 89, 107, 122, 157, 165, 169, 188, 222, 235, using 
5 the structural sequence numbering def ined in Table 1. 

20. The variant according to claim 19, comprising one 
or more of the following substitutions: 

R35K,S 
R122K,V,P 
10R169K,V 

R122K,V,P+A124P 
R122V, P+A124P+T208V 

21. A trypsin-like Fusarium protease variant accord- 
ing to claim 1, in which one or more amino acid residues 

15 susceptible to oxidation has/have been replaced with another 
amino acid residue less susceptible to oxidation. 

22. The variant according to claim 21, in which the 
amino acid residue less susceptible to oxidation is selected 
from the group consisting of A, E, N, Q, I, L, S and K. 

20 2 3 . The variant according to claim 21 comprising one 

or more of the following substitutions: 

M180N,Q,E,K, using the sequence numbering of Table 1. 

24. A trypsin-like Fusarium protease variant accord- 
ing to claim 1, in which a N-glycosylation site has been intro- 
25duced at an amino acid residue X located in a loop structure on 
the surface of the protein subject to proteolysis by changing 
the sequence segment X-Y-Z to Asn-Yl-Ser or Asn-Yl-Thr, pro- 
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vided that Yl is different from Gly, so as to confer to the 
variant an improved proteolysis resistance. 

25. The variant according to claim 24, in which X is 
the amino acid residue located in the position 17 , 18, 20, 21, 
523, 24, 25, 34, 35, 36, 38, 39, 48, 49, 50, 59a, 59b, 59c, 60, 
61, 62, 63, 69, 70, 71, 72, 73, 74, 78, 79, 80, 81, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 98, 99, 109, 110, 111, 113, 114, 
115, 116, 117, 118, 121, 123, 124, 125, 126, 127, 128, 129, 
130, 131, 132, 133, 134, 143, 144, 145, 146, 147, 148, 149, 
10150, 151, 152, 153, 154, 163, 164, 165, 173, 174, 175, 176, 
177a, 178, 184, 185, 185b, 187, 188, 201, 202, 203, 207, 215, 
216, 217, 221, 222, 223, 224, 225, 233, 234, 235 or 236 of the 
parent trypsin-like protease using the amino acid numbering de- 
fined in Table 1. 

15 26. The variant according to claim 24, in which the 

residue Y is Lys or Arg. 

27. The variant according to claim 26, comprising one 
or more of the following substitutions : 

S34N+R35R, K+N36S, T 
20N36N+G38S+G39S,T 

S72N+R73R, K+T74T, S 

V88N+R89R, K+V90S, T 

A121N+R122R, K+L123S, T 

LI 2 3N+ A2 2 4 A+ A2 2 5 S , T 
25S164N+R165R,K+A166S,T 

28. A trypsin-like Fusarium protease variant accord- 
ing to claim 1, in which the net electrostatic charge of the 
parent trypsin-like protease has been changed by deleting or 
substituting one or more negatively charged amino acid residues 

30 by neutral or positively charged amino acid residue (s), and/or 
by substituting one or more neutral amino acid residues by 
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positively or negatively charged amino acid residue (s) , and/or 
by deleting or substituting one or more positively charged 
amino acid residue (s) by neutral or negatively charged amino 
acid residue (s) . 

5 29. A variant according to claim 28, which has a 

lower pi as compared to the pi of its parent protease, 

30. A variant according to claim 28, which has a 
higher pi as compared to that of its parent protease. 

31. The variant according to claim 30, in which the 
10 amino acid residue occupying one or more of the positions 17, 

18, 20, 21, 23, 24, 25, 26, 27, 34, 35, 36, 38, 39, 40, 41, 48, 
49, 50, 59a, 59b, 59c, 60, 61, 62, 63, 65, 67, 69, 70, 71, 72, 
73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 86, 87, 88, 89, 90, 91, 
93, 94, 95, 98, 99, 100, 101, 109, 110, 111, 113, 114, 115, 

15116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 128, 
129, 130, 131, 132, 133, 134, 135, 137, 143, 144, 145, 146, 
147, 148, 149, 150, 151, 152, 153, 154, 156, 157, 159, 161, 
163, 164, 165, 166, 167, 169, 170, 171, 173, 174, 175, 176, 
177a, 178, 179, 184, 185, 185b, 185c, 187, 188, 189, 192, 201, 

20202, 207, 208, 210, 215, 216, 217, 219, 221, 222, 223, 223a, 
224, 225, 226, 233, 234, 235, 236, 237, 239, 240, 241 or 242 of 
the parent trypsin- like protease (using the amino acid number- 
ing defined in Table 1) have been deleted or substituted with 
a neutral, negative or positive hydrophilic amino acid, 

25 preferably selected from E, D, K, R, S, A, P or T. 

32. The variant according to claim 28, in which one 
or more amino acids around the active site has been substituted 
with an other amino acid sequence so as to obtain a change in 
the polarity around the active site. 



30 33. The variant according to claim 32, in which the 

amino acid sequence occupying the position 214 of the parent 
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trypsin-like protease (using the amino acid numbering defined 
in Table 1) has been substituted with another amino acid. 

34. The variant according to . claim 33, in which the 
following substitution has been made: 

5S214A. 

35. A variant of a parent trypsin-like protease, in 
which at least one of the loop structures II, IV, VI, IX, X and 
XIII has been substituted with a loop structure, which 

a) is at least 60% homologous to the corresponding 
10 loop structure in the trypsin-like Fusarium protease comprising 

the amino acid sequence shown in SEQ ID No. 2, and/or 

b) reacts with an antibody raised against the 
corresponding loop structure of the trypsin-like Fusarium 
protease comprising the amino acid sequence SEQ ID No. 2 and 

15 which recognizes said trypsin-like Fusarium protease. 

36. The variant according to claim 35, in which the 
parent trypsin-like protease is derived from a microorganism, 
a mammal, a vertebrate or an insect. 

37. The variant according to claim 36, in which the 
20parent trypsin-like protease is derived from any of the 

mammalian trypsin-like protease listed in Table 2 herein, or is 
a trypsin-like protease derivable from the bacterial species 
Streptomvces , in particular from S. ariseus . 

38. A DNA construct comprising a DNA sequence 
25 encoding a trypsin-like protease variant according to any of 

claims 1-37. 

t 

39. A recombinant expression vector which carries a 
DNA construct according to claim 38. 
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40. A cell which is transformed with a DNA construct 
according to claim 38 or a vector according to claim 39. 

41. A cell according to claim 40, which is a microbi- 
al cell. 

5 42. A cell according to claim 41 which is a bacterial 

cell or a fungal cell. 

43. A cell according to claim 42, in which the 
bacterial cell is a cell of a gram-positive bacterium, e.g. of 
the genus Bacillus or Streptomvces or a cell of a gram-negative 

10 bacterium, e.g. of the genus Escherichia, and the fungal cell 
is a yeast cell, e.g. of the genus Sarcharomvces, or a cell of 
a filamentous fungus, e.g. of the genus Aspergillus or Fu^ 
sarium . 

44. A method of producing a variant of a trypsin-like 
15 protease according to any of claims 1-37, wherein a cell accor- 
ding to any of claims 40-43 is cultured under conditions 
conducive to the production of the variant, and the variant is 
subsequently recovered from the culture. 

45. A detergent additive comprising a trypsin-like 
20 protease variant according to any of claims 1-37, optionally in 

the form of a non-dusting granulate, stabilised liquid or 
protected enzyme. 

46. A detergent additive according to claim 45 which 
contains 0.02-200 mg of enzyme protein/g of the additive. 

25 47. A detergent additive according to claim 45, which 

additionally comprises another enzyme such as a lipase, a 
protease, amylase, peroxidase and/or cellulase. 
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